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Preface 


The aim of this book is to offer a self-consistent overview of a series of is- 
sues relating entropy, information and dynamics in classical and quantum 
physics. My personal point of view regarding these matters is the result of 
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scientists: Heide Narnhofer in the first place, who introduced me to quan- 
tum dynamical entropies and was a precious guide ever since, then Robert 
Alicki, Mark Fannes, Giancarlo Ghirardi, Andreas Knauf, John Lewis, Ge- 
offrey Sewell, Franco Strocchi, Walter Thirring, Armin Uhlmann. To me, all 
of them have been a constant example of rigorous mathematics and physical 
intuition jointly at work. 

Last but not least, my deep gratitude goes to my family and to the many 
friends on whom I could always count for support and encouragement with 
a special thought for Traude and Wolfgang Georgiades. 
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1 Introduction 


This book focusses upon quantum dynamics from various points of view which 
are connected by the notion of dynamical entropy as a measure of information 
production during the course of time. 

For classical dynamical systems, the notion of dynamical entropy was in- 
troduced by Kolmogorov and developed by Sinai (KS entropy) and provided 
a link among different fields of mathematics and physics. In fact, in the light 
of the first theorem of Shannon, the KS entropy gives the maximal com- 
pression rate of the information emitted by ergodic information sources. A 
theorem of Pesin relates it to the positive Lyapounov exponents and thus to 
the exponential amplification of initial small errors, in a word to classical 
chaos. Finally, a theorem of Brudno links the KS entropy to the compress- 
ibility of classical trajectories by means of computer programs, namely to 
their algorithmic complexity, a notion introduced, independently and almost 
simultaneously by Kolmogorov, Solomonoff and Chaitin. 

In a previous book by the author, the notion of quantum dynamical en- 
tropy elaborated by A. Connes, H. Narnhofer and W. Thirring (CNT entropy) 
was presented within the context of quantum ergodicity and chaos. The CNT 
entropy is a particular proposal of how the KS entropy might be extended 
from classical to quantum dynamical systems. 

After the appearance of the CNT entropy, other proposals of quantum dy- 
namical entropies appeared which in general assign different entropy produc- 
tions to the same quantum dynamics. The basic reason is that each proposal 
is built according to a different view about what information in quantum 
systems should mean. Concretely, it is a general fact that, in order to gain 
information about a system and its time-evolution, one has to observe it and 
a quantum fact that observations may be invasive and perturbing. Should this 
fact be considered inescapable and thus incorporated in any good quantum 
dynamical entropy or, rather, should it be avoided as a source of spurious 
effects that have nothing to do with the actual quantum dynamics? 

This is an unavoidable question and, based on the possible answers, one 
is led to different notions of quantum dynamical entropies. These will be 
sensitive to different aspects of the quantum dynamics and thus, not unex- 
pectedly, not equivalent: the real issue is which these aspects are and what 
kind of informational meaning they do posses. 
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2 1 Introduction 


In view of the role of the KS entropy in classical chaos, one of the prin- 
cipal applications of the quantum dynamical entropies has been to the phe- 
nomenology of quantum chaos. The scope has now become wider: quantum 
compression theorems and recent attempts at formulating a non-commutative 
algorithmic complexity theory motivate the study of whether and how the 
different quantum dynamical entropies are related to these new concepts. In 
particular, a better understanding of the many facets of information in quan- 
tum systems may come from clarifying the relations of the various quantum 
dynamical entropies among themselves and their bearing on quantum com- 
pression schemes and the algorithmic reproducibility of quantum dynamics. 

The issue at stake can be conveniently conveyed by an example: the sim- 
plest classical ergodic information source emits bits independently of each 
other with probabilities 1/2 for both 0 and 1. The KS entropy is log2 and 
represents 


1. the information rate of a classical source emitting independent bits; 

2. the Lyapounov exponent of the classical dynamical system consisting in 
throwing a fair coin; 

3. the algorithmic complexity of almost every resulting sequence of tails and 
heads. 


The quantum counterpart of such an information source is a so-called quan- 
tum spin chain, that is a one-dimensional lattice carrying a 2 x 2 matrix 
algebra at each of its infinitely many sites: each site carries a so-called qubit . 
The dynamics of such a system is just the shift from one site to the other and 
the infinite dimensional algebra of operators is equipped with a translation- 
invariant state. These non-commutative structures have recently become of 
primary importance in the boosting field of quantum information. What is 
relevant is that one can construct subalgebras of quantum spin chains char- 
acterized by varying degrees of non-commutativity between their operators. 
Depending on that degree, the CNT entropy, varies between zero and log 2, 
while another quantum dynamical entropy, the AFL entropy of Alicki, Fannes 
and Lindblad, is always log 2. The CNT entropy thus appears to be sensitive 
to the amount of non-commutativity between operators, whereas the AFL 
entropy is apparently independent of that structural algebraic property. 

Because of its unifying properties, the KS entropy can be taken as a 
good indicator of classical randomness and complexity; one would then like 
to assign a similar role to the quantum dynamical entropies. Does this mean 
that, in accordance with the CNT entropy behavior, quantum dynamical 
systems have varying degrees of complexity or randomness depending on 
the degree of non-commutativity ? Or, according to the AFL entropy, the 
algebraic structural properties have no bearing on dynamical randomness or 
complexity, which are rather related to the statistics of such systems, namely 
to their shift-invariant state? 

More concretely, one may ask which one of the two quantum dynamical 
entropies is closer to the actual quantum informational structure of these 
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quantum sources. Regarding this issue, of particular interest are the yet un- 
explored relations of the quantum dynamical entropies to the quantum algo- 
rithmic complexities. 

Indeed, as there inequivalent generalizations of the KS entropy, so there 
are different extensions of the classical algorithmic complexity. These exten- 
sions have been motivated by the possibility of a model of computation based 
on the laws of quantum mechanics and on the theoretical formulation of the 
notion of Quantum Turing Machines (QTMs ). Like Classical Turing Ma- 
chines (TMs ), QTMs consist of a read/write head moving on tapes with, 
say, binary programs written on them. Only, the tapes of QTMs can occur in 
linear superpositions of the classical configurations of 0’s and 1’s. In a word, 
inputs and outputs of QTMs are qubits. 

Since the various quantum dynamical entropies were proposed, indepen- 
dently of quantum information, as tools to better study the long-time dy- 
namical features of infinite quantum systems, one may doubt that relations 
should exist between them and quantum information. One notices, however, 
that the CNT entropy was developed using the notion of entropy of a sub- 
algebra which, years later, independently appeared in quantum information 
theory as a measure of entanglement known as entanglement of formation. 
Also, the AFL entropy is based on techniques that in quantum information 
theory are fundamental tools to describe quantum channels and, more in 
general, all quantum operations that may affect quantum systems. 


The book is organized in three parts. 

In the first part, the first chapter presents basic notions of ergodic theory, 
the second gives an overview of entropy in information theory, the third 
addresses the notion of KS entropy and the classical compression theorems, 
while algorithmic complexity is the subject of the fourth chapter. 

The second part consists of three chapters; the first offers an overview 
of algebraic quantum mechanics with particular emphasis on the notions 
of positivity and complete positivity of quantum maps and quantum time- 
evolutions, both reversible and irreversible. The second chapter introduces 
the fundamentals of quantum information, the relations between positive 
and completely positive maps and quantum entanglement, the entropy of 
a subalgebra, the entanglement of formation and the accessible information 
of a quantum channel. The third concerns infinite quantum dynamical sys- 
tems and quantum ergodicity, quantum chains as quantum sources and the 
quantum counterparts to Shannon’s theorems. 

In the first chapter of the third part, a detailed introduction is given to 
the CNT and AFL entropies and to their use in the study of dynamical infor- 
mation production in quantum systems. Finally, the second and last chapter 
of the book focusses on some recent extensions of algorithmic complexity to 
quantum systems, starting with a discussion of quantum Turing machines 
and quantum computers and concluding with an exploration of the possible 
role played in this context by the quantum dynamical entropies. 


4 1 Introduction 


The topics addressed come from rather different fields that only recently, 
because of the birth and rapid development of quantum information, quantum 
communication and computation have started to overlap. This book has been 
written not as an introduction to any of these topics (of which exhaustive 
presentations do exist in plenty), rather as an attempt to provide readers with 
expertise in some, but not in all of the topics, with a self-consistent overview 
of these many subjects. Therefore, care has been taken to give proofs of 
almost all of the results that have been used, apart from basic and standard 
facts, and to illustrate them by means of selected examples. 
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Classical Dynamical Systems 
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In the first part of the book classical dynamical systems are presented 
from the points of view of ergodic, information and algorithmic complexity 
theory. 

Ergodic theory studies the clustering properties of equilibrium states; in 
information theory the central notion of entropy is used to quantify the de- 
gree of predictability of phase-space trajectories, while algorithmic complex- 
ity theory quantifies their randomness in terms of how easily they can be 
described by algorithms. 

The purpose of this presentation is to set up a suitable algebraic frame- 
work that makes easier the extension of these three points of view to quantum 
dynamical systems. 


2 Classical Dynamics and Ergodic Theory 


In this chapter the term classical dynamical system will broadly refer to one- 
parameter families of transformations, or dynamical maps, T; acting on a 
phase space ¥ whose points x describe the system degrees of freedom. In 
physical applications, x identifies an initial state, or configuration, Tx the 
resulting state or configuration after a span of time of length t. If t is dis- 
crete, t € Z, one speaks of a reversible time-evolution through discrete time 
steps with trajectories {Tx Jez consisting of countably many configurations 
at negative and positive integer times. If t € N, this means that the dynam- 
ics can only develop forward in time and is thus irreversible. In the case of 
a continuous-time dynamics, trajectories through x € ¥ at t = 0 are contin- 
uous sets {T;x}rer of configurations if the dynamics is reversible, otherwise 
trajectories are only forward in time, {Tx her+. 

Once the description of a system by means of a phase-space X has been 
chosen, any phase-point x € Æ contains all possible information about the 
system state. When all this information is not available, the state of a system 
amounts to a normalized positive measure on ¥, a probability distribution, 
such that the volume of a measurable subset gives the probability that x 
belong to it. Entropy quantifies the amount of information corresponding to 
such probability distribution, that is how informative the measure is about 
the actual state of the system. 

Beside the knowledge of the state of classical systems, information can 
also concern how states change in time, in particular, as regards foreseeing 
their behavior; the degree of predictability of dynamical systems is measured 
by dynamical entropies. Intuitively, regular time-evolutions should allow for 
reliable predictions, which are instead hardly possible for irregular dynamics; 
roughly speaking, irregularity is expected to correspond to the fact that the 
past does not completely contain the future. 

Information about the state or the time-evolution of physical systems can 
be obtained by measuring suitable quantities accessible to experiments. These 
quantities, called observables for short, correspond to functions on ¥. Unlike 
for quantum dynamical systems, for classical ones any measuring protocol 
can in principle be assumed not to interfere with the system observed, the 
basic reason being that classical descriptions involve commuting objects, as 
functions on the phase-space ¥ indeed are. 


F. Benatti, Dynamics, Information and Complexity in Quantum Systems, 9 
Theoretical and Mathematical Physics, DOI 10.1007/978-1-4020-9306-7_2, 
© Springer Science+Business Media B.V. 2009 


10 2 Classical Dynamics and Ergodic Theory 


Which observables are appropriate to describe a dynamical system de- 
pends on the structure of the chosen phase-space 1; for instance, statistical 
descriptions require that VY be endowed with a measure-structure, whereby 
measurable functions constitute appropriate observables. On the other hand, 
X might be provided with a topology and typical observables would then 
correspond to continuous functions. 


2.1 Classical Dynamical Systems 


In this section we review some basic facts relative to classical dynamical 
systems mainly adopting a measure-theoretic point of view; in this way a 
minimum of constraints is put on the mathematical properties of states, ob- 
servables and dynamical maps and the emerging technical context is broad 
enough to describe a large variety of physical phenomena, from those typical 
of Hamiltonian mechanics to those better understood in terms of discrete 
dynamical systems. 


Definition 2.1.1. Classical dynamical systems are triplets (X,T, u), where 


1. X is a measure space with an assigned a-algebra X of measurable sets; 

2. T is measurable, that is A € X > TTA) € X; 

3. X is endowed with a T-invariant, positive normalized measure u, such 
that (X) = 1 and po T7! = p. 


Remarks 2.1.1. 


1. A collection X of subsets S C ¥ is called a measure-algebra if 1) ¥ € X, 
2) S € X implies ¥ \ S € X, where Sı \ S2 denotes the complement of 
the subset Sə relative to the subset S1, and 3) S; € X for i=1,2,...,n, 
implies Lj_, S; € X. A measure-algebra X is a measure o-algebra if it 
is closed not only with respect to finite unions of its elements, but also 
with respect to countable unions, that is if JL} Sn € X for all {S,,}9@,, 
Sn E X. Since the complements of unions of sets are the intersections of 
the complements of the sets, namely æ \ (AU B) = (4X \ A) U (£ \ B), 
o-algebras contains infinite intersections of their elements, too. 

2. Let Xo be a measure-algebra, by adding to Xo infinite unions and inter- 
sections of elements of Xo one obtains a o-algebra X which is the smallest 
one containing Xo; such X is called the o-algebra generated by Xo. If the 
measure space ¥ is endowed with a topology, then, the g-algebra gener- 
ated by the open subsets is known as Borel o-algebra and its elements as 
Borel sets [258]. 

3. A positive function u : X ++ Rt, such that (a) = 1 is a probability 
measure on 4 relative to a o-algebra X if it is o-additive, namely if 


Aa 


On 
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Esse snsi=0=a(Ü s.) = X MSh). 


n=1 n=1 


Notice that u is automatically monotone under inclusion, namely 


AC B = A = (A\ B) U B = p(A) = „(A \ B) + p(B) > u(B) . 


. The following criterion is rather useful: an additive positive finite map 


u: X |> R* is ø-additive if and only if lim, (Bn) = 0 for any collec- 
tion {B,}°2, of sets Bn € X such that Bn+ı C Bn and (), Bn = 9. 
Indeed, suppose p is -additive and {Bn}; has decreasing properties 
and empty intersection; then, the sets Cn := Bn \ Bn+1 are disjoint and 
Bn = U,>n Cr- It thus follows that u(B1) = Xz; w(Cx), whence 


CO 


lim p(B,) = lim w(C,) =0. 


n—0o 
k=n 


Vice versa, let u be positive, finite and additive on X and take any col- 
lection {C,,}°2, of disjoint subsets of X; because of additivity 


„(Ü a) = Yon) + u( Ù Cr) 


k=n+1 


Since Bn := URina1 Ce © Bn-1 and Ap Bn = 0, o-additivity follows. If 
uis o-additive over a measure algebra Xo it can be extended in an unique 
way to the o-algebra X generated by Xo. In other words, given a S € X, 
for any € > 0, there exists S” € Xo such that u(S AS’) < £, where 


GAs = (9. 8) U (S! 8) Sus) \ (SA S’). (2.1) 


. A regular Borel measure on ¥ is a measure on the Borel o-algebra such 


that, for any measurable subset B and £ > 0, there exists an open, U+, and 
a closed subset, Cz, with Ce C B C Ue such that w(Uz\C-) < £ [258, 313]. 


Definition 2.1.1 provides an appropriate framework for irreversible dy- 


namical systems in discrete time whereby the time-evolution of phase-points 
x € & consists in successively applying the dynamical map T to x so that 
trajectories are given by countable sets {Tx}, cn. For reversible, discrete- 
time dynamics, also T7! is assumed measurable, that is T(A) € © if Ae X 
with uo T = p; trajectories are then of the form {Tx} nez. 


The measure u defines a probability distribution over X: if f : X + R is 


a measurable function (an observable of the system), its mean value is 


af) = I du(st) f(a) . (2.2) 
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In particular, if A C ¥ is a measurable subset and 1,4(x) its characteristic 
function t, the volume 


p(A) = l dae (2.3) 


has a natural interpretation as the probability that x € 4X belong to A. 
We shall as well refer to these probability distributions as to the states of 
a classical dynamical system. In fact, in the case of a continuous phase- 
space, access to phase-points is practically never achievable; thus, one has to 
content oneself with the knowledge of how phase-points are distributed over 
X. From a physical point of view, the fact that states u are assumed to be T- 
invariant means that the statistical description of dynamical systems refers 
to equilibrium states. Interestingly, a measure-theoretical dynamical triplet 
can be represented in terms of a unitary operator on a Hilbert space [17, 61]. 


Example 2.1.1 (Koopmann-von Neumann Formalism). [175] 


Let (¥,T,) be a measure-theoretic dynamical triplet. Finite additions 
and multiplications of characteristic functions of measurable subsets A; C ¥ 
give the algebra G(X) of simple functions s = |; ci 1a, over X. Lebesgue- 
integration with respect to u defines a scalar product (s1 | s2), over G(%), 


(si 2 )u2= Do (eb) Ff TOTO ORNO WAP 0 43), 
X J 


ij ij 


for 14(x)l g(x) = lang(x). Further, by linearly extending the map defined 
by 14 Ur 14 :=140T = 1r-1ı(4), one gets a linear operator Ur on G(4). 
Since uo T7! = u, Ur preserves scalar products 


(Urs: |Ursa)u = XO (4) oj a( TA} n A2)) = (511 52) y - 
ij 
Therefore, the Koopman operator Ur can be extended to an isometric im- 


plementation of the dynamics (invertible and thus unitary in the reversible 
case) on the Hilbert space L? (X) of square-summable functions on ¥, 


(Urp)(x)= (Tr) WeEL?(X), reg. (2.4) 


The spectral properties of Up will turn out to be of particular relevance for 
ergodic theory (see Section 2.3). Using a bra-ket quantum like notation, we 
observe that: 


1. the identity function I(x) = 1 almost everywhere with respect to u, is 
always an eigenvector of Up with eigenvalue 1, Ur| 1) =|loT) =|T); 


t 1a(a) =1 if @ € A, 14(x) = 0 otherwise 
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2. if 1 is a degenerate eigenvalue, then there exist constants of the motion 
14 y €L2(4), Ury) =|boT) =|¥); 

3. mean values are scalar products, u(y) = (1|w), for all y € L3 (4); 

4. products of mean values amount to the matrix elements of the orthogonal 
projection | 1)(1| 


u(y) wd) = (YIII), Vy, o ELLE), (2.5) 


where w* is the complex conjugate of ~. 


Hamiltonian Mechanics 


Hamiltonian mechanics is an important source of classical dynamical sys- 
tems [16, 17, 299]. Systems with f degrees of freedom are described by 
a phase-space which is a 2f-dimensional manifold My C Rf x Rf whose 
points r = (q,p) consist of positions q = (q,...,qf) € Rf and momenta 
p = (pı,... pf) € Rf. The phase-space inherits a symplectic geometry from 
e 3 ) where Oş and ily are the 
f x f zero and identity matrices, respectively. Via the symplectic matrix one 
defines the Poisson brackets of two (differentiable) functions F, G : My > R, 


the symplectic matriz J := [Jij] = 


Tage! eo ae (2.6) 


t,j=1 


With respect to them, q and p are canonical coordinates: {qi , pj} = 6;; and 
the time-evolution is generated by the Hamilton equations 


Of py, 2 Bien 


where H = H(r) is a (time-independent) Hamiltonian or energy function of 
the system. They are solved by the Hamiltonian flux r +> r(t) = OF (r), 
t € R °. The time-evolution of functions F on My then amounts to a group 
of dynamical maps F +> F; := Fo@F that solves the time-evolution equation 


dFi(r) 
dt 


={F,, H}(r). (2.7) 
Suppose My = Rf; then, a natural o-algebra for the phase-space M f 
is the Borel o-algebra (see Remark 2.1.1.2) containing all open subsets of 


?One can always extract a discrete time-evolution {T7"}ncz from it by fixing 
t= 1 and setting T := F. 
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the topology of My given by the Euclidean distance. The Liouville mea- 
sure dr = Tey dqidp; is invariant under the Hamiltonian flux 6”; however, 
f M; dr diverges and cannot be normalized to a probability distribution. A 
way out typically occurs when there are constants of the motion, that is func- 
tions F on Mp, like the Hamiltonian itself, such that {F , H} = 0. By fixing 
their values, the dynamics is restricted to time-invariant submanifolds that 
usually have finite volumes. Instances of equilibrium states leading to descrip- 
tions of Hamiltonian systems as measure-theoretical triplets (Mp, 8}? , un) 
(discrete time), or (My, {ĐE her, ua) (continuous time), are in general pro- 
vided by probability distributions dug (r) = f(r)dr, where f : Ms |= Rt is 
a normalized, positive functions such that {f , H} = 0. Prominent instances 
of such probability measures are the micro-canonical, canonical and grand- 
canonical states of classical statistical mechanics [300]. 

The time-invariance of states as the previous ones deserves to be exam- 
ined in some more detail as it follows from a duality argument which we 
shall frequently encounter in the following. Duality is essentially the obser- 
vation that the mean value of a function F at time t, F, with respect to a 
state u equals the mean value of F with respect to the state u+ at time t, 
u(F:) = u(F). This defines the time-evolution of states as the dual of the 
time-evolution of observables (functions); indeed, from time-invariance of the 
Liouville measure it follows that 


u(F)= | dr u(r) FGF (r)) = | dr wO2,r) F(r) =: m(F), (2.8) 
Mş My 


whence u+ := uo PE, solves the time-evolution equation 


PH) L mehr) (2.9) 


Example 2.1.2 (Regular Motion). Consider two uncoupled harmonic 
one-dimensional oscillators described by r = (q1, q2, p1, p2) E€ Mz = R* and 


by the Hamiltonian 
Bh p? i mals p 
3 °°." Dig S 
—— ——— ———_—_’ 
Hı(r) Hə(r) 


By fixing the single oscillator energies H;(r) = Ei, i = 1,2, the motion 
develops on the 2-torus T? := {0 = (61,02) : 6; € [0,27)}, where it amounts 
to a two-dimensional rotation. Indeed, setting 


QJ; ; 
qi = cos 6; > Br vV 2mjw; Jy sin 0; 5 


MiWi 
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one gets angle-action variables (0, T), 0 = (01,02), T := (hh, I2). 

These are canonical coordinates, with Poisson brackets {6;, Jk} = dix} 
moreover, H(r) = K(J) = w1 J, + w2J2. Thus, the corresponding Hamilton 
equations, 


dé dJ 
a" Pe w = (w1,W2) , 
are solved by the Hamiltonian flux 
T,: 0+ O(t) :=7;(0) =O4+wt. (2.10) 


By varying E,,E£2 and thus I, the phase-space R* is covered by non- 
intersecting 2-dimensional tori. On each fixed torus, d@/(27)? gives a prob- 
ability measure which is invariant under the Hamiltonian flux. The triplet 
(T?, T := T}, d0/(27)?) fulfils the requirements in Definition 2.1.1. 

In the Koopman-von Neumann formalism, the unitary operator Ur im- 
plementing T on H := L?(T?,d0/(27)?) has the exponential functions 
en(0) = exp(in- 0), n € Z?, as eigenfunctions , 


(Uren)(8) = en(O + w) =e! Vja1 905443) = et Eja ne en(0). (2.11) 
Therefore, the time-evolution of H > |W) = X nez W(n)| en) is given by 


|b) = URI) = X pin) EI en), REZ. (2.12) 


neZ? 


Remarks 2.1.2. 
LP A p,q € N, trajectories close since 0(2qr /w1) = 0 mod 27. 
Ww 


2. If Hake are no 0 Æ m2 € Z such that njw; + ngw2 = 0, then, every 
trajectory {0(t)}:cr fills the 2-torus T? densely. Namely, for any ¢ > 0, 
,0 € T?, there is t € R such that ||@(t) — @)|| < £, where the norm is 
the Euclidean norm computed mod 27. Indeed, using (2.10), 


ty := (61 — 01) /wy => 01 (ty + 2n7/w 1) = dı mod 2r , 


for all n € Z. Since T is compact, the sequence {62(t. + 2n7/w1)}nez has 
accumulation points; thus, for any e€ > 0 there exist n,p € N such that 


[ot +2(n + p)r/w1) — 02(tx + 2n /w)| = 2pr— mod27 <e, 
1 


whence the sequence {62(t, + 2npr/w,)}nen subdivides the circle into 
disjoint intervals A,, of length 


[ott + 2(n + l)pr/w1) — 02(t. + 2npr /w1)| <e. 
Therefore, 


$2 € Am = ||0(te + 2mpr/w1) ~ oll = |B2(t. + 2pmm/w1) — do| < €. 
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3. A similar argument as before shows that, in discrete time, trajectories 
{O(n)}nez fill T? densely if and only if there are no integers nı 2 # 0 
such that niwi + now = 2rp with Z > p Æ 0 [91]. 

4. Example 2.1.2 is a particular instance of the Liouville-Arnold theo- 
rem [16, 17, 299] on integrable Hamiltonian systems. Suppose a canonical 
system with f degrees of freedom possesses f global constants of the mo- 
tion K;, Kı := H in involution, that is {K;, Kj} = 0, i,j = 1,2,...,f. 
If the subset Ny := {K;(r) = k; : i = 1,2,..., f} C My is compact and 
connected and the differential 1-forms dK; are linearly independent on it, 
then Nķ is isomorphic to the f-torus Tf. Moreover, there exists a canon- 
ical transformation from r € Ny, to angle-action variables (0, J) such 
that the Hamiltonian flux PF is isomorphic to an f-dimensional rotation 
on Tf with J-dependent frequencies: O(t) = 0 +w(J)t. Accordingly, the 
phase-space Mẹ foliates into disjoint f-tori which are filled densely by the 
trajectories {0(t)}ier when — nw (J) = 0, n; € Z, only if all n; = 0. 
Tori such that SL niwi(J) = 0 for 0 Æ ni € Z are called resonant and 
on them trajectories close. The independence of the oscillation frequen- 
cies w from the actions J in Example 2.1.2 is an exception due to the 
linearity of the Hamilton equations. 


Integrable Hamiltonian systems cannot behave too irregularly as their 
motion amounts to a multi-dimensional rotation over invariant tori. In order 
to increase the degree of irregularity, some constants of the motion must 
disappear in order to let the trajectories wander around according to less 
predictable patterns. In the following example, a constant of the motion is 
eliminated by means of a folding condition. 


Example 2.1.3 (Hyperbolic Behavior). [17, 271] 

Let ôp(t) denote the periodic delta function } „ez 6(n—t) with unit period 
and consider a free one-dimensional motion with periodic quadratic kicks, 
occurring with strength @ € R, according to the pulsed Hamiltonian 


H= 5(°° +OP): 


A natural dynamical map T consists in updating the vector r = (q, p) on 
phase space from immediately after the n-th kick to immediately after the 
n — i-th one; namely T : Tn > Tn41, where Tn := (qn, Pn) and 


qn := lim q(n+€£), Pn:= lim q(n+e). 


e—0+t e—0+t 
Integrating the Hamilton equations 


dq _ dp _ 
ae e q Or) Ba 
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first between Tn + € and T(n + 1) — € and then between T(n + 1) — © and 
T(n + 1) +e, yields 


n+l1+e 
q(n+1+e)—q(n+e) =p(n+e) +f ds p(s) 


p(nt+1+e)—p(nt+te)=—Sq(n+1). 


By letting £ — 0*, the integral is of order £ and vanishes; thus, the dynamical 
map T reduces to a 2 x 2 matrix acting on R?: 


r=(4) oar, a=(1, oar (2.13) 


Since det(A) = 1, the Liouville measure dr = dqdp is T-invariant. The 
eigenvalues of A, 


+1_2-btvVp-4) __2-8tvy-2)-4 


om = = ’ 


2 2 


are real with |a| > 1 when 8 < 0 or 8 > 4. The corresponding eigenvector 
|a,) identifies a direction in R? along which lengths increase exponentially 
for n > 0, while they contract exponentially along the direction of the eigen- 
vector | a_ ) relative to the other eigenvalue |a|~! < 1. This motion is called 
hyperbolic. 


For 6 = —1, A = (; >) is symmetric thus (a_ |a} ) = 0 and, writing 
Ir) = a+) +4] a_), 


Irl? = byPe2 8% + [6Pem 2984, (2.14) 


where rn := A”r. Therefore, the norms of all vectors r Æ 0 increase ex- 
ponentially while remaining on the hyperbolae selected by fixing a value of 
F(r) := q? — p° + qp. Indeed, one can directly check that F(rn+1) = F(rn), 
whence this function is a constant of the motion [118]. This is no longer true 
if one imposes a folding condition that forces the dynamics to develop on the 
two-dimensional torus T? := {R? > r = (q, p) mod (1)}, namely if one defines 
the dynamical map 


Ta: T? > rt ry := (A"rmod1)€T?. (2.15) 


Then, the resulting triplet (T?, Ta, dr) is as in Definition 2.1.1 and the map 
T is known as Arnold Cat Map [17]. 

More in general, one may consider the dynamics on the 2-dimensional torus 
T? generated as in (2.15) by a matrix 


A= (: a , a,b,c,dEZ: ad—be=1, ja+d|>2, (2.16) 
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with eigenvalues at! € R. Since A need not be Hermitian, its normalized 


eigenvectors | a+) = e are in general only linearly independent; one 
2+ 


explicitly computes 


1 
ait = bAs , a24 = (a®! —a)A} where A4 := F 
1 2 ( ) ~ — Jb? + (att =, a)? 
(2.17) 
and expands R? 5 |r) = (=) = C4 (r)|a+) + C_(r)|a_) with 
pe Ta2— — Yaı— = Yai4 — TAQ+ 
Cy(r) = — zy C_(r): aoe aaa (2.18) 
where 
A := Det Ga ~ =b(1 — @?) 44A. (2.19) 
ag+ Q2— 
Then, the hyperbolic behavior shows up since 
A®lr) =a" C4(r)|a}) + a *®C_(r)|a_) (2.20) 


d+ d)? — 4 
and the absolute value of one of the eigenvalues a4 = on Ca) 


is larger than 1. 
Consider now the Koopman operator Ua on H := L4,.(T?); the orthogonal 
exponential functions 
en(r) := exp(27in- r) (2.21) 


are such that (AT denotes the transposed of A) 
(Usen i = POO) = OO) = ena (r) , (2.22) 
whence, setting Y(n) := (en |Y) for all y € H, it turns out that 
(Uaw)(n) = (en |Ua |b) = (ea-tn |b) = YAn) . 


Therefore, Ua has no other eigenvector but e9 = 1: if Ualw) = u| y) for 
 €H with |u] = 1; then, with |b) = Znez HM) en), 


(em |U%b) = X b(n) (em learn) = P(A Pm) = Pol) , 


neZ? 
for any fixed m € Z?. Since Y(n) > 0 with ||n|| — oo, if (m) £ 0, then 
lim, ~(A~?m) = 0 because of hyperbolicity, while yPy(m) oscillates. 


The exponential amplification of small errors that results from (2.14) 
(or form (2.20)) cannot hold for arbitrarily large n: In fact, ||rp|| < V2 
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so that the expansion is eventually counteracted by the folding condition 
in (2.15). Suppose |r) = ceļa), then ||r,|| = ce” 28% < v2 increases until 
n < log(e~1V/2)/(log a). 

This argument applies to any pair of initial conditions rt:?; their distance 
||r1— r?|| increases exponentially due to the expanding contribution from the 
component of r! — r? along |a;) until the folding condition affects one of 
the two cartesian components of r! — r?. Notice however that the smaller 
is ||r! — r?||, the longer the amplification lasts. This observation allows the 
introduction of the notion of asymptotic divergence rate of initially close 
trajectories even when they develop on compact phase-spaces: these rates are 
known as Lyapounov exponents and are a measure of dynamical instability. 


Definition 2.1.2 (Maximal Lyapounov Exponent). [199, 106] The 
maximal positive Lyapounov exponent of a dynamical triplet (V,T, u) equipped 
with a distance d(x,y) is defined by 


. 1l aT”; Ty) 
AM = | = l log =——— > 
wee nee” e 


Of course ¥ may be a multi-dimensional space and thus there might be 
more directions along which distances expand exponentially fast with ex- 
ponents A(x) > 0; the intuitive picture behind the definition is that, for 
sufficiently small d(x, y), the distance at time n is such that 


d(T 2, Ty) = (2) d(x, y) (1+ Oene), 
where A(x) < Am(x) [62]. 


Remarks 2.1.3. 


1. A rigorous approach to Lyapounov exponents can be found in [199]; here, 
we sketch a few basic facts (see [106, 313]). Assume the phase-space æ 
to be a compact manifold with a Cə differentiable structure, a Borel o- 
algebra and a Riemannian metric such that the tangent spaces T,(4) at 
x € X are isomorphic to R? equipped with an Euclidean structure. The 
dynamics T : X +> X is assumed to be continuous with continuous first 
derivatives, so that one can focus upon its linearization 7,(T) that maps 
the tangent space 7,(4’) into the tangent space Trz(A’). In particular, 
one is interested in the asymptotic behavior of ||7,(T”)|| where, by the 
chain rule, 

Tal T”) = Tpn-1g(T) © Tpn-2y © `- -Tg(T) . 


Let X be equipped with a T-invariant regular Borel measure u (see 
Remark 2.1.1.5); then, there exists a measurable subset B C 4 with 
u(B) = 1 and a positive measurable function s : B +> R+ such that, given 
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x € B, there are real numbers {AM (x) BO), AD (a) < AG+D (x), and lin- 
ear subspaces of R*, VOEG, VO = {0}, V(r) c VOY (g), 
V@) = RF, for which 
1 > ; 

a) lim -log ||ra(T®)r]| = AP (x) for all r € W;(æ) := VO (£) © 

no n 
VG-D (x); 
b) G(x) is defined, measurable and T-invariant on the subset of x € B 
such that s(x) > j, that is AP (Tx) = AD) (x); 
ce) Te(T)V (x£) c VO (Tx) for all j < s(x). 
It thus follows that, if AP) (x) < 0, the norms of all r € V(x) go to 0 


exponentially fast with n — +00. On the other hand, if \) (x) > 0, the 
norms of all vectors r € VO) (x) © VOH (x) diverge exponentially fast. 


. There can be more than one positive Lyapounov exponent thus more than 


one amplifying direction in space. In volume-preserving dynamical sys- 
tems to any amplifying direction there corresponds a shrinking direction 
(amplifying in the past). 


. On compact manifolds, the two limits in Definition 2.1.2 do not commute: 


the numerator is limited by compactness, whence the 1/n limit vanishes 
if performed before letting d(x,y) — 0. 


. If there is an intrinsic smallest distance ô > 0 between points x,y € ¥ and 


the largest possible distance A is finite, then the Lyapounov exponent is 
zero. This means that exponential separation or amplification cannot be 
extended beyond the logarithmic time-scale set by 5e*! < A. This gives 
A 


1 
a so-called breaking-time [118] tg := ` log F 


. When the motion develops on a compact phase-space, the existence of 


positive Lyapounov exponents is known as eztreme sensitivity to initial 
conditions and provides a widely accepted definition of classical chaotic 
motion [271, 228]. Notice that without the folding condition, also an 
inverted harmonic oscillator with Hamiltonian H(r) = p?/(2m)—mwg? /2 
would show an exponentially fast separation of initial conditions, though 
far less irregular and interesting than one on a compact manifold. 


2.1.1 Shift Dynamical Systems 


Phase-spaces with a finite or a countable number of states are typical either 
of systems which arise from suitable discretizations of otherwise continuous 
phase-spaces or of intrinsically discrete systems as cellular automata [62]. The 
first possibility arises in particular when the observations aimed at identifying 
the system state as a point of phase-space have a finite accuracy; then, one 
performs a coarse-graining of phase space into a certain number of regions 
whose volume is determined by the given accuracy and whose interior points 
are accessible only through observations of higher accuracy. As we shall see 
in later sections, in such a case, the system states are identifiable with the 


2.1 Classical Dynamical Systems 21 


labels of the regions where the system state is localized and the dynamics 
corresponds to jumping from label to label rather than from point to point 
of the phase-space. 

Instead, the phase-space of cellular automata [62, 18] is discrete from the 
start as they consist of copies of a same system (automaton) described by 
a d-valued function i, for instance, in the binary case i = 0 may be used to 
signal when an automaton is deactivated, i = 1 when it is activated. 

The phase-space ¥ of a cellular automaton with N systems comprises d% 
configurations (states) corresponding to finite strings i) = (i1, i2,...,in) € 
A) := {1,2,...,d}%. The dynamics is given in discrete time by a map 
T: R a R that updates the configurations from time n to time n +1: 
i) (n) = i®)(n+1). The state ip(n+1) of the k-th automaton at time n+1 
in general depends on the states of some or all other automata at time n. In 
the following, we shall focus upon a most simple class of cellular automata, 
that is shift dynamical systems [17, 61, 164, 313]. 

Let the space ¥ be the collection Ra := {0,1,...,d}% of all sequences 
i = {i;}j;en of symbols from a finite alphabet 7; = 1,2,...,d. Each į can be 
interpreted as a configuration of a countable network of cellular automata, 
each of them being indexed by an integer j € N, with 2; denoting its actual 
state among the d possible ones. Let Tp : Ra — Na be the left shift along 
sequences, 

(Tot); = tj41 5 (2.23) 
and set i(n) := T?4: Tọ amounts to a rather trivial dynamics, namely to a 
deterministic updating whereby the state i;(n+ 1) of the j-th automaton at 
time n+ 1 depends only on (is equal to) that of its right nearest neighbor at 
time n: 

ij(n +1) = (Tgi); = (Tot) j(n) = ij+1 (n) . 

From the point of view of a fixed automaton, say the 0-th one, this kind of 
dynamics is typically like tossing a coin. Indeed, suppose the initial configu- 
ration 7(0) of the network is to be chosen randomly, according to a probabil- 
ity distribution where all automaton states occur with the same probability 
27N. Because of the dynamics, this property is then inherited by the sequence 
{io(n)}nen of successive states of the 0-th automaton. 

In order to provide the shift along binary sequences with a measure- 
theoretic formulation as in Definition 2.1.1, the set Q4 of infinite sequences 
has to be equipped with a o-algebra of measurable sets. The standard way 
to do this is by means of the so-called cylinders [61, 164, 91, 17]; they consist 
of all sequences whose entries have fixed values within chosen intervals: 

OPN cig = {ie Oy Piet) £=0,1,...,4— 5} . (2.24) 
— 


i(k-j+1) 


They are labeled by the interval [j,k] and by the binary string i*-I+}) of 
length k — j + 1 of assigned digits within that interval; each one of them can 


22 2 Classical Dynamics and Ergodic Theory 


be obtained as a finite intersection of simple cylinders ot, 


CH a= n OY. OP Ne mausit: (2.25) 

We shall denote by Cjj,4) the sets consisting of the 2°°-J+)) cylinders 
on} j+- Lhe o-algebra X is obtained from all possible unions and intersec- 
tions of simple cylinders. Further, pre-images of cylinders under T~! remain 
cylinders: in fact 


(om) = {i EMRh: Tie a} = {i Paes hans ee ie} 
= cf , (2.26) 


whence T, is measurable with respect to X. 


Remark 2.1.4. The left shift on unilateral sequences is not invertible; it be- 
comes so by choosing instead of N24 the set Q7 of all doubly infinite sequences 
i = {i;j}jez. Then, the same result as in (2.26) holds for the pre-images of 


cylinders under Ts, TCE) = oy, whence T7 * is also measurable. 


We shall refer to any probability measure u on X as to a global state 
on Nq; to any such u there correspond local states ujj) on the cylinder 
sets Ci;,,]- As cylinders in C;; ;,) are in one-to-one correspondence with strings 
iit!) c Ok +) of length k — j + 1, these local states are probability 


distributions on oe +1), 


_ k—j+1 
Hij,k] = {p Pri ny (i wd Se 
d 


pum (e*) > 0, 5y pat OY) =1. (2.27) 


(kh—j (k—-j+1) 
it-it) Een 


Consider the sequence of local states {u™ } nen, 


pm) = POEmo , p (i”) = Pini”) G (2.28) 
on the cylinder sets Cj, ,); since Gt PA -Ù or ., from the additivity 


of the measure the following compatibility coded follows 


d 
p™ (izig...in) = plete a) = Sop (iiz... ini) . (2.29) 
i=1 
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Particularly important global states over q correspond to shift-invariant 
probability measures u, po T>+ = u. From 


pO ae es 2) Sai. 


it then follows that 
d d 
Ty fous ; ln 2,n = l,n—1 
ya (iiz sein) = Salake) = oe) = mee we n) 
i=1 i=1 


=p opor =p Migein) (2.30) 


As a consequence, if u is shift-invariant the probabilities assigned to cylinders 


C i depend only on the values 7;7;41...%% defining the cylinder, but 


not on the interval [j, k]. 


Remark 2.1.5. Interestingly, the conditions (2.29) and (2.30) defines a dy- 
namical triplet (Qa, To, u) in the sense of Definition 2.1.1. This is the content 
of Kolmogorov representation theorem [266]: if © = {1,2,...,d}, the set Qa, 
as the infinite Cartesian product ¥*% of countably many copies of ¥ can be 
equipped with the product topology which is the coarsest one with respect to 
which the projection maps 7; : t > tij are continuous, namely the one gener- 
ated by union and intersections of preimages T; (B) of sets B € X that are 
open with respect to the discrete topology of X. Then, N4 is a compact set by 
Tychonoff theorem [251]. Namely, any open cover of 92 also contains a finite 
subcover, whence in any collection of closed sets in 2 with empty intersection 
there also exists a finite sub-collection with empty intersection [251]. 


Suppose one is given a collection of numbers pr) (4) as in (2.28) sat- 
isfying (2.27); they assign volumes py) -= p™(é™), and define local 
states on the measure algebras generated by these cylinders. If the quantities 
p™ (é™) fulfil (2.29), the local states extend to a positive, finite and addi- 
tive function u on the o-algebra X generated by cylinders. In order to show 
that u is also o-additive and thus a measure, one uses Remark 2.1.1.4 and 
that each set in X the o-algebra is closed in the product topology. Therefore, 
given any decreasing sequence X D {C;,}°2, with empty intersection, com- 
pactness ensures that there exists a finite sub-collection {C}, }¥—; such that 
Mar Cn; = Ø, whence limpn_.oo(Cn) = 0. 


Further, suppose that the quantities p”) (i) also fulfil (2.30), then it 
turns out that 


1142...05-1 


XO pD aa ade) 


1g...0j—1 
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for all £ = 1,2,...,7 — 1. Therefore, p( OF, | = p®-I+D (i; ... ig) whence 
Sji [j,k] = [j,k] . apr m 
ii (er) = wae.) and the measure p is shift-invariant. 


Example 2.1.4 (Bernoulli shifts). Consider a shift dynamical system 
(Qa, Ts, u); the simplest choice of local states pu”) corresponds to product 


measures on a”, 


p (i, +++ in) = Let» , pli) >20, > rl) =, (2.31) 


These dynamical triplets are known as Bernoulli-shifts; if d = 2 and ¥ = 
{0,1}, (Q2, To, 4) amounts to repeatedly tossing a coin, possibly biased if the 
probabilities of head (0) and tail (1) are different. 


Example 2.1.5 (Markov Chains). Shift dynamical systems slightly more 
correlated are Bernoulli A are the so-called Markov shifts. Given the 
local states u“ = = {p™(il Mnp the ratios 

d 


p™ (iriz +++ tn) 


pO) (iyig +++ in—1) 


(2.32) 


Dlin|tite->* tpi) := 


define conditional probabilities for the n-th symbol to be in if the previous 
n — 1 ones are 71---+%,-1. The global state u is said to possess the Markov 
property if and only if the following conditions occur: 


P(in|irte che i in— 1) = Plinlin—1) (2.33) 
d 

> Pils) = 1 (2.34) 
p 4=1 

x: P(t) PG) = p(i) - (2.35) 


Condition (2.33) means that the conditional probabilities (2.32) depend only 
on in and on 7,_; and not on the previous symbols, so that 


piriz: in) = (TI plGesalie)) pli) - (2.36) 
g= 


Therefore, local states u”) are completely specified by the d x d matrix 
P = [p(in|in—1)] and the probability vector |p) = t) 
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Because of (2.34), the matrix P is a stochastic matrix, namely its entries 
p(il|j) are positive and qualify as transition probabilities as they express the 
fact that the system cannot but remain in the same state or change into 
another one. It follows that condition (2.29) is satisfied, indeed from (2.36) 


d d 
Sp (iini) = X- plil ilin—1 (Hr iesılie))p(i j1) 
i=1 s= 


- (I> (icsilée)) pl) = PY (iia ina) - 


Further, because of (2.35), the probability vector is an eigenvector with eigen- 
value 1 of the matrix P and (2.30) is also satisfied, whence the local states 
p™ generate a global shift-invariant states on 94. In fact, 


d n-1 d 
YP (iis in) = (JI pieli) X pildo) 
1 l=2 i=l 
= (TI plies lie) pliz) = 2° nisin) 
l=2 


Notice that Bernoulli shifts are particular instances of Markov chains with 
transition probabilities p(i|j) = p(i) for all j = 1,2,...,d. 


? 


2.2 Symbolic Dynamics 


As already remarked, states corresponding to a continuous phase-space can 
only be identified with finite precision that is they can be located within 
subsets of small, but finite size, and cannot be further resolved. A typical 
case is when the finite accuracy available corresponds to the subdivision of 
the phase-space in a finite number of non-overlapping measurable subsets, 
namely to a coarse-graining of the phase-space X by means of a so-called 
finite partition [7, 167]. 


Definition 2.2.1 (Partitions). 


1. A finite, measurable partition (partition for short) P of (X,T, u) is any 
collection of measurable subsets P; C X, i € Ip, Ip an index set of finite 
cardinality, such that P,P; = fori # j and | P; = & The subsets 
P; are called atoms. 

2. A partition P = {Pi} ier, 1s finer than a partition Q = {Qj Jero (Q 
coarser than P), symbolically Q < P, if the atoms of Q are unions of 
atoms of P: Qj = Uietjcrp P,, for all j € Ig. 


ielp 
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3. Given two partitions P = {Pi}ier, and Q = {Qj}je7,, the partition 
PV Q={PiNQ;hierp,jelo ts the coarsest refinement of P and Q. 


Example 2.2.1. [61] Quite often, ¥ is endowed with a o-algebra X which is 
generated by a measure-algebra Xo; it is then possible to approximate within 
£ any finite ©-measurable partition P = {P,}¢_, by a finite Y-measurable 
partition Q = {Q;}%_, with atoms Q; € Xo, in the sense that (see (2.1)) 
u(P,AQ;) < £, i = 1,2,...,d. Indeed, because of Remark 2.1.1.4, given 
ô > 0, for any P; € P one can find Qj € Xo such that u(P; AQ‘) < 6; notice 
that P; N Pj =0, thus x € Q; N Q} and z ¢ P; yield x € Q; A Pj, whence 


QN Q} E Qi A PiU Q} AP; = MQN Q4) < 26. 
The sets Q; need not form a partition; however, let Q’ := Wes Qi N Q}, 
which is such that u(Q') < d(d — 1) ô and set 


Qi:=Q\Q , i=1,2...,d-1, Qu=X\U Q. 


These are atoms of a partition Q C Xọ. Consider first the symmetric differ- 
ences Q; AP;, i = 1,2,...,d — 1; one has that, if x € Q; and z ¢ P;, then 
x € Q; AP;, while, if x € P; and z ¢ Qi, then z E€ Q; AP; or z e BAQ, 
whence 


Q; AP; C Q' U (QAP) = p(Q: AP) < (dd = 1) + 1)6. 


Since Py = ¥ \ US=} P; and (¥ \ A) A (£ \ B)= AAB, 


d—1 d—1 d—1 
Qu AP, = (U Q;) A (U P;) c (Q; AP) 
g=1 j=1 g=1 


yields (Qa A P4) < (d — 1)(d(d — 1) + 1) 6, whence the result follows by 
choosing ô = (d—1)~!(d(d—1)+1)7? 


The volumes ju(P;) =: p(i) of the atoms of any partition P provide a 
discrete probability measure up := {u(P;) ier» on P. While atoms in general 
change under the dynamics T, 


P, = TWP) := {r€ X: Tie P} VWj>0, (2.37) 


their volumes do not for T is assumed to preserve p. 
Further, if P;, N P;, = 0, then T~/(P;,) NT~4(P;,) = 0. Therefore, for all 
j EN (j €Z if T has a measurable inverse) 
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PI := TI (P) = {TI (P;) hier (2.38) 


are partitions with the same probability distribution of P: yp; = up. Further, 
partitions at successive times are all refined by the partition 


n—l1 


P™ := (Y PE =P V THP) V- V T(P). (2.39) 
j=0 


If p := card(Ip), the atoms 


Pe Pe (hie Oe) (2.40) 


a 


of P\) are labeled by strings i!” := igi, ---in—1 € R. We shall denote by 
ey {pa = HP) reams l (2.41) 


the probability distribution associated with P™ and consisting of the vol- 
umes of its atoms with respect to the given probability measure p. 


Remark 2.2.1. Notice that a phase-point x € X belongs to the atom P,(n) 
of P™ if and only if Tz € P; for all 0 < j < n—1. As a conse- 
quence, the atoms of P‘) contain all phase-points x € X whose trajectories 
{T!x}jez successively intercept the atoms P;, of P identified by the string 
i) = igi ein € a, As an effect of the coarse-graining, segments of dif- 
ferent trajectories {T! tn may correspond to a same i”) € Qa”. thus, as 
normalized volumes, the probabilities p (4) quantify how likely is it that 
different initial conditions give rise to a same segment of trajectory between 
(discrete) time j = 0 and j = n— 1. 


Lemma 2.2.1. Given a reversible dynamical system (X,T, p) and a partition 
P = {Pitietp, card(Ip) = p, the dynamics T : X + Æ corresponds to the 
left-shift (2.23) on sequences i € Q% (see Remark 2.1.4). 


Proof: Let i(x) € 27 be the sequence of atom labels corresponding to the 
trajectory {T} (x)}jez with initial point x € Æ. According to section 2.1.1 
and to (2.37), i;(«) = i; if and only if Tix € P; . Then, from (2.23), 


Therefore, any coarse-graining of ¥ by means of a partition P provides 
a description of the dynamical triplet (¥,T, u) in terms of the left shift 
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on the sequences in es The segments of trajectories up to time n — 1 are 


in one-to-one correspondence with the sets of strings i™ € RÇ? and the 
probability distribution us ) provides states over the cylinder sets C™”-1, By 
means of (2.40) and of the T-invariance of u, one shows that conditions (2.29) 
and (2.30) are fulfilled, whence the local states po define a global shift- 
invariant state up over o7. By varying x € 4, the trajectories {TI z} èz 
gets in general encoded by a subset Oo Cc O 


Definition 2.2.2 (Symbolic Models). Given a partition P of X, the triplet 
(25, gs up) provides a symbolic model for the dynamical system (X,T, p). 


Example 2.2.2 (Baker Map). The Baker map (see Figure 2.1) is the in- 
vertible map of the two-dimensional torus T? = {æ = (x1,22) , mod 1} into 
itself given by 


i 1 
(22,2) 0<zriı< -= 
Tgg = o 1 : 
(20 i =) 3 <xı<l 
; 1 
(222) 0< g< 
Ts!a#= 2 2 
ESD BEN a |. Ey ed 
2 i] 2 2 2 
1 1 1 
Tz" Tp i 
— — 2 
0 4 1 0 3 1 0 1 
| 75" |7» 
1 1 
1 
4 
0% 1 0 1 


Fig. 2.1. Baker Map 
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The map Tg is measurable with respect to the Borel o-algebra of T? 
and preserves the Lebesgue measure du(x) = dx daz: altogether, one has a 
dynamical system described by the measure-theoretic triplet (T?, Tg, dz ). 
It is evident that, when n — +o0, a sufficiently small distance between 
two points æ and a + ô increases as 2” along the horizontal direction until 
it gets of order 1. Therefore, Definition 2.1.2 gives log2 > 0 as maximal 
Lyapounov exponent of the Baker map; instead, small distances decrease 
exponentially along the vertical direction with the same speed so that volumes 
are conserved. 

Let wi(t1) = {wi}iso and w-(x2) = {w_j;}j>1 be the half-sequences 
consisting of the coefficients of the binary expansions of x1, respectively x2: 


D Da 
x = a Hi = ar a $ 
1 Qi+1 ? 2 2j 


j=0 j=1 


Setting w(x) := (w_(x1),w4(x2)) = {w;(x£)}jez E Rə and using the mod 1 
folding condition defining T?, it turns out that Tg is isomorphic to the left 
shift Tọ on (22, namely w;(Tgæ) = wj41(@). 

Further, the Lebesgue measure on T? corresponds to the uniform product 
measure (2.31) on the o-algebra generated by cylinders. This can be seen 
as follows. According to Remark 2.25, cylinders are intersections of simple 


cylinders as on and gis that correspond to the vertical rectangles Py = 
{æ : 0 < a < 1/2} and P, = {æ : 1/2 < a < 1} and their images 
ci = TiC), j € Z (see (2.26)). Under Tz’ they get rotated into 
horizontal rectangles; successive applications of the Baker’s map split them 
into horizontal rectangles of half height, each one of them having as neighbors 
halved rectangles coming from the other initial rectangle. 

It turns out that co, is a horizontal rectangle of width 1 and height 

car- 


40 ,21,+--,2n-1 
+1 whose 


2-"+!. a further intersection with gw provides the cylinder 
corresponding to a horizontal rectangle of width 1/2 and height 27 
area is 27”. These areas may only come from a product measure, 


n—-1 
n n— j j 1 . 
up OR D= [peP peP W. 
j=0 


Therefore, the coarse-graining of T? given by P = {Po, Pı} provides the 
symbolic model (22, Ty, up) for (T?, Tg, dz ). 


2.2.1 Algebraic Formulations 


In this section, instead of referring to phase-space trajectories, we shall con- 
sider classical dynamical systems from the point of view of their observables 
and of their time-evolution. By observables we mean suitable functions over 
the phase-space. 
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It is convenient to consider complex-valued functions f : Æ > C; their 
values f(x) can be inferred by measuring real, R(f), and imaginary parts, 
S(f). Further, it is reasonable to assume that functions f,g in a suitably 
chosen class of observables give observables in the same class under addition, 
(f,9) = (f+ 9)(«) = f(x) +9(«), and multiplication either by scalars a € C, 
(a, f) + (af)(x)af (x), or by another observable, (f,g) — fg(x) = f(x)g(2). 

In other words, it is a reasonable physical assumption to require that ob- 
servables constitute algebras of functions on Æ: these algebras are commuta- 
tive for fg = gf. Physically speaking, there are no fundamental obstructions 
to the fact that classical measuring processes can, in line of principle, be per- 
formed without effects on the state of the measured system. As the measured 
values depend on the system state, it follows that measuring g and then f 
yields the same results as measuring f and then g. 

Also, it is practically convenient to approximate certain observables in 
the algebra by means of other observables that are in a certain sense close to 
them; we shall thus assume these commutative algebras of observables to be 
endowed with topologies and to be closed with respect to them; in particu- 
lar, we shall consider algebras of observables where converging sequences of 
functions do converge to observables in the algebra. 

Like in Examples 2.1.2, 2.1.3, in the following we shall assume ¥ to be 
compact in a metric topology and measurable with respect to the Borel o- 
algebra that contains all its open and closed sets. Then, a natural algebra of 
observables is provided by the continuous functions on Æ [258]. 


Definition 2.2.3. Let X be a compact metric space; C(X) will denote the 
Banach *-algebra (with identity) of continuous functions f : ¥ ++ C endowed 
with the uniform topology given by the norm 


C(*) > f = fll = sup{|f (x)| : x € X}. (2.42) 


Remarks 2.2.2. 


1. If f,g € C(X) and a € C then f +ag € C(X) as well as fg € C(AX). 
Sums, multiplications by complex scalars, by continuous functions and 
complex conjugation * : f(x) +> f*(x) all map C(¥) into itself. These 
facts make C(¥) a *-algebra with a norm f +> || fll; indeed, 


Ifl=Oef=0, lafl = lall Al f+ gll < F+ lll, 
for all f,g € C(&), a € C. This norm defines the uniform neighborhoods 
U(f) := {g9 E C8) : |f -gll <e}, fec), (2.43) 


and equips C(4’) with a metric and a corresponding topology called uni- 
form topology, Ta. 
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2. A sequence {fn}nen C C(A’) is a Cauchy sequence if, for any € > 0 
there exists N € N such that nm > N => ||/fn — fm|| < £; since 
all Cauchy sequences in C(4’) converge uniformly to f € C(%), that is 
limn || f — fn|| = 0 or lim, fn = f, C(X) is termed a Banach algebra. 
Also, [lf*/ll = IIfIl2, f*ll = IIfil and [loll < Ill llgl for all f, g € C(2); 
this makes C(¥) a C* algebra (see Definition 5.2.1). 

Because of assumed compactness of ¥, the identity function I(x) = 1 
belongs to C(4’). When 4 is not compact, one considers the *-algebra 
Co(¥) consisting of the complex continuous functions on 4 vanishing at 
infinity. When equipped with the norm (2.42), Co(#) is a Banach algebra, 
but the identity function does not belong to it. 


93 


A description of dynamical systems by means of continuous functions 
is, however, too restrictive, in general. For instance, the corresponding C* 
algebras cannot contain observables related to yes/no questions like 


is the state localized within a measurable subset (region) A € ¥ or not? 


as these correspond to characteristic functions 14 of A which are only mea- 
surable and not continuous. The Koopman-von Neumann formulation of Ex- 
ample 2.1.1 offers a natural way to enlarge the algebra of observables. In a 
quantum-like notation, we shall denote by |% ) any function in L? (¥) and by 
(«| 4) its value y(x) at x € X. Functions f € C(4’) can then be represented 
on L2 (¥) as multiplication operators My: 


(z| Ms lw) = fpl), Vy E LZ) . (2.44) 


In the following, we shall identify, C(4) and its representation by multipli- 
cation operators, that is we shall identify My and f. 


Remarks 2.2.3. 


1. The maps C(¥) 3 f => Ly(f) := || fl w)|| are semi-norms. They define 
strong-neighborhoods on C(4), that is neighborhoods in the so called 
strong topology, Ts, 


USHA = {ge C8) Mf -g)l¥sdll Se, Sjn}. (245) 


Since ||(f — goly < If — gille 9 € Uyf) = g E€ UP; 
therefore, every strong-neighborhood contains a uniform neighborhood 


and is thus a uniform neighborhood itself; in general, however, there 
can be uniform neighborhoods which are not strong-neighborhoods, so 
that the uniform topology is finer than the strong topology, Ts < Tu; 
namely, J, has more neighborhoods. Practically speaking, a sequence 
fn € C(X) converges strongly to f € C(&), s — lim, fn = f, if 
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limn—co |I(f — fal Y) =0 Vy € L? (X), and, while all uniformly con- 
vergent sequences converge strongly, there can be strongly converging 
sequences which do not converge uniformly. 


. If {fn}nen converges with respect to T,,, it converges also with respect 


to Ts, but not vice versa; it follows that the strong closure of C(%), 
that is C(4) together with all its possible strong limit points, is strictly 
larger than C(’). Indeed, it contains C(4), simple functions and discon- 
tinuous functions f that may jump arbitrarily but only on sets of zero 
measure [258]. Equip ¥ with a o-algebra and a measure pu; then, 


lfl = inf{a > 0: w({e: If) > a}) =O} , 


where f is a measurable function on ¥, defines a norm ||- ||. called 
essential norm. If || fll. < œ, then |f(x)| > ||f||.. only on a set of zero 
measure; further, the following collection of measurable functions, 


L(#) = {f: IIflloo < oo} , 


is a C* algebra with respect to the essential norm known as the algebra 
of essentially bounded functions. 


. There is another topology on C(4) which is inherited by its multiplicative 


action on L? (X ) and which is coarser than the strong topology, namely 
the weak topology, Toe < T; < Ta. It is generated by the semi-norms 
Lol f) = K o| Mp |y )| which defines the weak neighborhoods 
PMH = {9 € O(X) T E 155 Sm} . 

(2.46) 
A sequence fn E€ C(*#) converges weakly to f € C(¥), w—lim fn = f, if 
and only if Tim KoI- fal )| = 0 for all Y, ġ € L? (X). As we shall see 
in the more general non-commutative context, the strong and the weak 
closures coincide. In the case of C(4’) they give rise to L? (X) which has 
the structure of a so-called von Neumann algebra. 


. Actually, L (X) can be generated as the strong closure on L? (X) of the 


algebra containing the characteristic functions of finer and finer partitions 
of X. More precisely, one may consider a refining sequence {Ph }n>0, 
Pn < Pn+i, that generates the o-algebra of X when n — +00. Each Ph 
is a finite dimensional commutative algebra A, whose elements are the 
step functions that are linear combinations of the characteristic functions 
of the finitely many atoms of Pn; then, 


weak—closure 
Lie (X) = An 


n 
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In order to complete the formulation of measure-theoretic triplets (4, T, u) 
into an algebraic framework, one has to endow C(#) with a time-evolution 
corresponding to T and a map C(¥) + C that play the role of u by assigning 
mean values to continuous functions. 

We shall consider invertible continuous dynamical maps T on æ¥ and 
discrete-time dynamics. As in Example 2.1.2, any f € C(A’) changes in time 
according to 


f(a) > f(T'x) = f o T(x) =: fila), teZ. 


The map Or : C(X¥) + C(X), defined by Or(f) = foT is an automorphism 
of C(#); namely, it is invertible and 


Orla f + Bg) =aOr(f)+8Orlg), Or(fg) = Or(f)Or(g). (2.47) 


Moreover, Or preserves the uniform norm. 


Example 2.2.3. In the Koopman-von Neumann formalism where functions 
f € C(&X) are represented as multiplication operators, the Koopman operator 
Ur implements unitarily the automorphism Or; indeed, using (2.4), for all 
Y EL?(X)andzE X, 


(a|Ur f U$) = f(Tx) (Tx|Upy) = (Tx) (T oTe| 4) 
= («| Or(f)y). 


Notice that Ur cannot belong to C(%), otherwise it would commute with all 
f € C(&) which would then be constant in time. 


Concerning the possible states over C(¥) (see (2.2)), we shall consider 
the space M(%) of regular Borel measures over V (see Remark 2.1.1.5). 
The simplest instances of elements of M(&) are the evaluation functionals 
bx : C(X) + C, defined by 6,(f) := f(x), for all x € X and f € C(#). These 
functionals can be seen as integration with respect to Dirac delta distribu- 
tions and embody the fact that phase-space points are the simplest physical 


states: 6,(f) = J dy f(y) 6(y—). By making convex combinations of eval- 


uation functionals one obtains more general positive, normalized expectation 
functionals over C(%). 

Actually, a theorem of Riesz [258] asserts that the action of any such 
functional is representable by integration with respect to a regular Borel 
measure in M(X). In view of the physical interpretation of states as positive 
functionals that assign mean values to observables, it makes sense to identify 
measures u € M(X) and states w, : C(¥V) > C such that ? 


3For sake of notational convenience, we shall sometime employ the notation 


u(f) for wy (f). 
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A3 fru(f)= i. du(z) f(a), Vreo. (2.48) 


Remarks 2.2.4. 


1. With two measures 41,2 on a measure space ¥ all convex combinations 
pti +(1—p) 2 with p € [0,1] provide other measures; therefore, the space 
of states of classical systems is a convex set. 

. Given two measures 41,2 on XY equipped with a o-algebra X, ju; is said 
to be absolutely continuous with respect to u2, p1 < H2, if for any B € X 
H2(B) = 0 => p(B) = 0. Then, there exists a positive f € L/(4) 
such that yi(B) = f,due(x) f(x) for all B € X. The density f(x) is 


called Radon-Nikodym derivative and denoted by ya. If also u2 < pı 


then jt; and uo are said to be equivalent. Differently, yı and uo are called 
mutually singular, pı L po, if there exists B € X such that uı(B) = 0 
while fi2(V\B) = 0. 

. According to Lebesgue decomposition theorem, given two measures u and 
m on X, there exists a unique choice of measures 41,2 and of p € [0,1] 
such that u = pi + (1 — p)u2 with py < m and p2 L m. 

4. If ¥ = T? as in Example 2.1.3, then any Li, (T?) > p(r) > 0 with 
Jq2 Ar p(r) = 1 is the Radon-Nikodym derivative of a measure which is 
absolutely continuous with respect to dr. Vice versa, evaluation func- 
tionals 6,(f) = f(r) are singular measures with respect to dr. 


N 


ww 


Finally, a measure u € M(X) is T-invariant if the corresponding mean 
values are time-independent, w,(Or(f)) = w(f) or wu = w0 Or. Notice that 
(2.48) and (2.47) allows one to extend the state w, and the automorphism 
Or to the von Neumann algebra of essentially bounded functions L% (1). 


Definition 2.2.4. To any measure-theoretic triplet (¥,T,), where X is a 
compact metric space equipped with the Borel o-algebra and p E€ M(X) is a 
T-invariant regular Borel measure, one can associate a C* algebraic triplet 
(C(X),Or,w,) and a von Neumann triplet (LX (X), Or, wy) where state w, 
and automorphism Or are defined as in (2.48), respectively (2.47). 


2.2.2 Conditional Probabilities and Expectations 


Given a measure space (X, p) with o-algebra X, a finite partition P = {P;}?_, 
such that u(P;) > 0 for all i, and X € X, consider the following function 
XAR) . 
z E€ X = u(X|P)(zx) := LERA if ceP,, (2.49) 
WEG 


such that f, du (x) u(X|P)(x) = u(X A P;). (2.50) 
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It is the conditional probability of X € X given the partition P and repre- 
sents the probability of the subset X once it is known that x belongs to one 
of the atoms of P. This notion can be extended to the case of partitions with 
atoms P such that u(P) = 0 by assigning a same fixed, arbitrary real value 
to u(X|P)(x) when a € P: in such a way one gets a family of versions of 
the conditional probability each of which satisfies (2.50) [61]. One can ex- 
tend (2.49) and (2.50) and define probability distributions conditioned upon 
o-subalgebras T C X. 

Consider an integrable function f € L} (¥), the functional on 7 defined 


by F(T) = fa (x) f(x), T € T, is bounded, o-additive and absolutely 


continuous with respect to u (see Remarks 2.1.1.3 and 2.2.4.1); its Radon- 


Nikodym derivative a (x) =: E(f|T)(x) such that 


[ ewe) E(f|T)(x )= f aes (ct) VIET, (2.51) 


is J-measurable and integrable and is called the conditional expectation of f 
with respect to the o-algebra T. 

By choosing as f the characteristic function lx of a subset X € X its 
conditional probability given T is thus defined by u(X|T)(x) := E(1x|T) (x) 
and is such that 


[we w(X|T)(z) = w(XNT) YXES, TET. (2.52) 


Given a o-subalgebra 7 C X consider the Abelian von Neumann al- 
gebra L% (&¥,T) consisting of the essentially bounded T-measurable func- 
tions on XY (see Remark 2.2.3.2). This is a subalgebra of the Abelian von 
Neumann algebra L (X) of the X-measurable essentially bounded func- 
tions on ¥. Then, (2.51) makes the conditional expectation a linear map 
E(.|T) : LY(X) = LX(X,T) which is linear, positive and a measure pre- 
serving projection, that is zo E(-|T) = u and E(E(f|T)|T) = E(f|T). The 
first three assertions are evident, while idempotency is a corollary of the fol- 
lowing more general property. Suppose 71 < 72 are two o-subalgebras of 
X such that Ti € T => Tı € TQ but not vice versa, in general. Then, if 
f E€ LX(¥), (2.51) yields 


du (2) E(E(f|%)|T) = L du (2) EIT) = | du(e) fle) 


Tı Tı 


da (x) E(f|Ti) , 


Ti 


for all Ti € F4, whence T, < Tz => E(E(f|T2)|Ti) = E(f|T). 
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Proposition 2.2.1. If f € LY (¥) and g E LY(X,T), where T C X, then 
E(g f|T) =g E(fIT) . (2.53) 


Proof: Suppose g = 1r, the characteristic function of a subset T € T; 
then, for all To € 7, T O Tọ € T and (2.51) yields 


du (2) EAr FIT) = | aa) ta) = f 1p(x) E(f|T)(z) . 
To TNT To 


Then, one concludes the proof by approximating g € LX (X, T) with respect 
to the essential norm by means of simple functions. 


Given a refining sequence {T,,}nez, that isn < m Ta C Tr KE 
we shall set 74} := V„ez the smallest o-subalgebra containing all the T,,’s 
(Ta T T4), respectively denote by T- := Anez Zn the largest o-subalgebra 
contained in all the Zn (J, | T—). The proof of the following continuity 
properties can be found in [61] and [101]. 


Theorem 2.2.1. Let X be a measure space equipped with a o-algebra X and 
a measure u; given a refining sequence of o-subalgebras Tn, then 


im EIR) =E), lim E(fT) = BIT) , 


for all f € LẸ (X). 


Examples 2.2.4. 


1. f X DT := N, the trivial o-algebra consisting of the empty set Ø and 
the whole of ¥; then, E(f|M) = (f) 1. On the other hand, if T = X, 
then E(f|X) = f. 

2. By inserting characteristic functions f = 1x, X € X, in the above theo- 
rem, one gets the following continuity properties of the conditional prob- 
abilities: 


„ER M(X|In)(@) = u(X|T+)(2), u= a.e. 


for allX-measurable subsets of X. 

3. Consider the unit interval [0,1) with the Borel o-algebra X and the 
Lebesgue measure du (x) = da; construct the measure algebra 7p gener- 
ated by the partition P, of [0, 1) into 2” atoms Py = [k27”, (k +1)27”). 
Then, 7, | X and [61] 

2—1 (k+1)27® 
EGITE = X tr.a)?" yar f dt f(t) , 


g-n 


for all f € Lipy) (dt). For n — oo the summand containing z tends to the 
derivative of the integral at x and thus to f(x) p-a.e.. 
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2.2.3 Dynamical Shifts and Classical Spin Chains 


Dynamical shifts and symbolic models can be given an algebraic formulation 
in terms of classical spin chains. Consider a triplet (07; Ts, 4), that is a shift- 
dynamical system over doubly infinite sequences of symbols from an alphabet 
with p elements that leaves invariant a measure p. 

Let us associate to each symbol j € {1,2,...,p} a p x p matrix of the 
form 


0 0 0 vee 0 0 
0 0 0 vee 0 0 
WY 
(j,j) -thentry 

0 0 0 vee 0 0 
Varying 1 < j < p, we obtain an orthonormal family of orthogonal projec- 
tions such that P;P; = ôi; P; and De Pj = lp, where 1p denotes the p x p 
identity matrix; these projectors generate the diagonal p x p matrix algebra 
D,(C) 4 with elements 


d 0 0 0 0 
0 d 0 0 0 5 
D=| 0 0 O I Sgr. (2.54) 
. . . eee e * j= 
0 0 0 22s 0 


To each label j in a sequence 7 = {i;}j;ez one thus associates the diagonal 
matrix algebra D,(C): each of its minimal projectors thus corresponds to a 


simple cylinder. Extending the construction to generic cylinders Glia e A 
as in (2.24), these correspond to tensor products of projectors 
POr -Qr , 4 = ioi, in. (2.55) 


Then, the natural matricial description that one associates to strings of 
length n is the diagonal matrix algebra D™) = D8” := Qro (Dp(C));, 
namely the tensor product of n copies of D(C) whose elements are diagonal 
p” x p” matrices of the form 


Des. y wee (2.56) 
imen 


“These commutative matrix algebras are also called Abelian and projections as 
the P; are known as minimal projectors. 
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A suggestive physical picture is as follows: each matrix algebra D,(C) 
describes a classical spin, with p possible states, in an infinite classical fer- 
romagnet. Spins located at the lattice sites =n < j < n are described by 
tensor products of the form D[—n.n] >= @j=—n(Dp(C));. These matrix alge- 
bras can be interpreted as algebras of observables for finite portions of the 
infinite ferromagnet by the embedding Di-n,n) > 1-n-1] 8 Di-n,n] 8 Uns 
into Dæ := Unso Pi_n,n}, where 1_,_1 and 1,4; denote the tensor products 
of infinitely many identity matrices 1 € D,(C) located along the two-sided 
chain at sites from —oo up to —n — 1, respectively from n +1 up to +00. 

Each Di-n,n] can be equipped with the standard sup-norm of matrix 


algebras (see (5.3)) °. The sup-norm inductively extends to D and allows 
—— uniform 
to consider the uniform closure Dz := Unen Pi-n,n] . This procedure 


is known as C* -inductive limit [64] as it involves an increasing sequence of 
local algebras Di_n nj; Dz provides a C* algebraic description of a classical 
spin chain. 

Using (2.26), the left-shift along sequences gives rise to an algebraic shift 
map Os : Dz > Dz such that 


6.) aa) = Paty + (2.57) 


Further, the local probability measures pi := {p(i™ eet that 


yield the global T,-invariant state u over OF can be associated with diagonal 
matrices anal 
p= YS. pe yey Y, (2.58) 
ME Q™ 


by means of the trace operation (see (5.19)) which acting on any matrix re- 
turns the sum of its diagonal entries. In fact, multiplying oe” with matrices as 


in (2.56), gives another matrix in D™) with diagonal elements p(é')) d(i™), 
and one gets 
Tr(of) D™ ) = Spay ae) , 
ime 
whence p(i™) = Ta Bey. Therefore, conditions (2.29)- (2.30) 


translate into the following algebraic relations to be satisfied by { Oe} nen: 
Trn (ol) = Tri (P) = pr , (2.59) 


where Tr; denotes the trace with respect to j-th factor. These conditions 
allows to consistently define a global state w, on the spin chain Dz; this state 


°The sup-norm of a diagonal matrix D is the square root of the largest diagonal 
element of DÝ D. 
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is specified by its values as a positive expectation functional over local spin 
arrays where it coincides with the local states ow (which we shall encounter 


in the quantum setting as density matrices). 


Definition 2.2.5 (Classical Spin Chains). 
The C* algebraic triplet (Dz,Q>c,w,) associated with a measure-theoretic 
triplet (2p, To, p) will be referred to as a classical spin chain. 


Remark 2.2.5. In Section 5.3.2, it will be proved that to all classical spin 
chains as defined above, there correspond algebraic triplets as in Defini- 
tion 2.2.4. In particular, the von Neumann algebraic triplets arise when the 
C* triplets (Dz,O,,w,,) are represented on a Hilbert space and enlarged by 
adding to them their weak-limit points. 


Example 2.2.5. [113] Consider a Markov chain as in Example 2.1.5. Let 


p1) 0 0 

d 0 2 0 
Pore | p Po 
= 0 0 = pd) 


correspond to the probability measure u = {p(i)}4_,. Define on the tensor 
product Da(C) @ Da(C) a linear map E : Da(C) ® Da(C) > Da(C) by linear 
extension of the following action on tensor products of minimal projectors 
P; € Da(C) 


? 


P; 8 P; > LLP; ® P;] = p(jli) P; , 


where P(j|i) are the transition probabilities of the Markov chain. From (2.34) 
and (2.35) and using that 5 Py = 1, 


d 
[1 @ 1] = » IIP; ® Pj] = > p(jli) P; = >P =1 


Te(p [1 @ Pal) =>" Tr(p |P; & Pal) = 3 p(kli) Tr(p Pi) 


ei 
Il 
un 


for all k = 1,2,...,d. Furthermore, higher order probabilities as in (2.36) are 
iteratively obtained as: 


plioii ina) = Tr(pEIP, @E[P, -+ EIP, @ 1]]]) . 
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For instance, to evaluate Tr(p ULP: Q E[Pi, ® 1) use De P; = 1, then 


mp LPi Q E[Pi, ® Pr] ) z Y Pikti) Te (p [Pig @ Pil) 
k=1 


= Tr(pE[Pi, ® Pa]) = plialio) Telo Pio) 

= p(ii|to) plio) = pliot1) - 
Therefore, the local density matrices oe” are the local restrictions w, MD) of 
a global state w,, on the classical spin chain Dz such that, for all D; € Da(C), 


wy (D1 9 Dz ®@ ++ Dn) =Tr(p [D @E[D2 @--- [Dn ® 111) , 


2.3 Ergodicity and Mixing 


The two uncoupled harmonic oscillators of Example 2.1.2 whose orbits fill 
the phase-space densely (see Remarks 2.1.2.2 and 2.1.2.3) are typical ergodic 
systems. Ergodic theory developed [167] from the attempt to explain why in 
thermodynamic systems time-averages of typical observables coincide with 
their mean values (f) with respect to equilibrium distributions. Intuitively, 
if an orbit fills the energy shell densely, evaluating the time-average of a 
function along such an orbit should indeed amount to integrating with respect 
to the Liouville measure restricted to the energy shell. 


Definition 2.3.1. Let f : X — C be a complex function associated to a 
dynamical system (X,T, u); time-averages are defined by 


in discrete, respectively continuous time. 


Example 2.3.1. Consider the uncoupled oscillators of Example 2.1.2 and 
a continuous function f : T? + R. By means of (2.12), the discrete and 
continuous time averages yield 


eit wey wini _ | 


(et Dia Wini — 1) 


en()= X Fn)enlO), 


neEz2 
Day winjE2nZ 


respectively 
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= a ett i wini _ | = 
é)= li ——,——— ¢n (9) = n(0). 
aA ttoo 2 ie) it Ia Wini a7 2 LN) 


UPL, wing =O 


Then, besides ensuring that orbits fill T? densely, the conditions in Re- 
mark 2.1.2.2 and 2.1. R i also imply ae time-averages coincide with their 


mean values: f(@) = = fyz d0 (6) = u( f). 


A considerable break-through in ergodic theory was Birkhoff’s theorem ®. 


Theorem 2.3.1. Given (2,7, n); let f€ Li,(&) be a complex p-summable 
function on X. Then, 


1. the time-average f(x) exists u-a.e. on X; 
2. the time-average f is T-invariant: foT = f p-a.e.; 
3. the time-average f € L} (X) and u(f) = u( f). 


The proof [91, 61] of these important results hinges on the following lemma 
known as maximal ergodic theorem. 


Lemma 2.3.1. Given the dynamical triplet (x, 7, n), for any f € Li (X), 
1 
set S$ (x) = LS* f(Tir) and Af := fz EX: supa S$ (x) > o}; then, 
j=0 


dy (x) f(x) 2 0. 


Af 


Proof: Set (x) := max{ 0, Sf(a),...,S£(a )} and split ¥ into the sub- 


set Af := 2 @ (x) > 0 and its complement where bY (a ) = 0. Further, 
p2 (x) := max{ $ f(x),..., Sf (x b= & (x) on Af; also, 
BO (a) = max{ f(a), f(x) + ITa) Ae) EATE) HTa) 


(a) + max{ 0, f(Tx),...,f(Tx) ++: f(T"2)} 
(x) + OY) (Tx) . 


Thus, since BY) (x) is non-negative, js is T-invariant and 6°) (x) > B°) (x), 


°Though the results presented below can be extended to dynamical systems in 
continuous time, we shall concentrate on discrete time dynamical systems. 
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Ml) Fe) | du (e) (kale) - aa) 
>f du (x) B® (x ef du (2) 6 (Tx) 


- f as OO (x )- f uoa (x) =0. 


Then, the result follows for, when n — +00, the points in Ay that are not in 
Af form a set of vanishingly small measure p. 


Proof of Theorem 2.3.1 Let a < b € R and, using the notations of the 
previous lemma, set 


Egy = {ae : lim inf — Gash a si(a yh. 


n—+0oo n n—+oo 


Let g(a) := f(x) —b when x € Fab, otherwise g% » (a ) = 0; consider the set 
1 g® 
{vex : sup —Sh"" (x) > 0} = {ae x : Sup z l si(a ) >b} . 
n n 


This set not only coincides with the set AI as defined in Lemma 2.3.1, but 
it also equals Fab; while the first property is contained in the definition of 
the set, the second one follows from the fact that, on one hand, 


1 1 
limsup -Sf (z) >b —> sup =Sf(x£) >b = Ea C AI, 
n n 


n— +00 n—-+00 


On the other hand, by definition of Egy, if x ¢ Eq, also T” x ¢ Ea for all n, 
a) 
whence g (Tx) = 0, Sr (x) =O and z ¢ Ada , Thus, Lemma 2.3.1 yields 


1 
J a) du (x) g\ (a) =f dp (x =o) > 
Afab Eav 


The same argument applied to the function gÊ (a := a — f(x) if x € Ea, 
(2) 


) 
otherwise g, (x) = 0, gives J du (x) (a — f(x)) > 0, whence 
Eav 


bu(Ea) < ie du (a) f(t) < au(Ea) . 


Since a < b this can only be possible if u(Eab) = 0; therefore, the limit 

a 1 

f(x) := lim -Sf (x) exists p-a.e. on XY. Namely, outside the union of all 
n—+oo n 


Ea» with rational a,b, which is still a set of zero measure, the sequence S$ (x) 
converges pointwise to f(x) which can however be too. 
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The limit function is T-invariant by construction; moreover, f € LI (X). 


Indeed, 
fuwl f uwe 


therefore, Fatou’s lemma 7 yields 


f du (x) | f(x) < timinf f an du (a ) ls 
x n—+0o0 


1 
Finally, choose A > 0, let g)(a) := Este) 
n 


f(a) < f antao; 


— à, and consider the set A” as 


in Lemma 2.3.1 ; then, 


[ewe | sie) — f(a)| < 
x 


f gn wO 


1 
i a s (2) 
AIX n 


[os 
Consider the third integral, Lemma 2.3.1 yields u(A%) < ` || fll1; as to the 
second one, it can be estimated as follows 


1 n—1 
<i Tx a (A 
SD fop MEDI + 0H) 


1 
aule) [8h 
k=0 


AIX 
: / ea POIO tana” , 


for some fixed a > 0. Now, u( A>) and thus the third integral can be made 
arbitrarily small by choosing an appropriate A, as well as the second one by 
setting a large enough. Further, Lebesgue dominated convergence theorem ®, 


1 Ks 
can be applied to f du (x) ste) — fe) which becomes negligibly 
X\ AIX n 
small when n — +00, whence 


yet 


fave) Ha) = tim, faa) 2842) = tim, Ef WOTA 
= f ane) Ae) 
x 


"Fatou’s lemma [258] asserts that if fn is a sequence of measurable functions on 
a measure space X, then du liminf fn < lim int f du fn. 
x n—+00 n—+00 


8 Lebesgue Dominated Convergence Theorem [258] asserts that if {fn}nen is 
a sequence of measurable functions on ¥ such that the limit f(x) = lim fa(£) 


exists for all « € ¥ and |fa(£)| < g(x) for all x € ¥ with g € LI(¥), then 
FELLY) and | dyu(x) f(a) = lim f au (2) fale) 
x x 


n—-+0o 
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In the light of Birkhoff’s ergodic theorem, we first give a general measure- 
theoretic definition of ergodicity and then consider its physical consequences. 


Definition 2.3.2. A dynamical system (X,T,) is ergodic if for all measur- 
able subsets T~'(B) = B implies u(B) =0 or p(B) =1. 


Remarks 2.3.1. 


1. The first conclusion to be drawn from this definition is that ergodic sys- 
tems cannot possess non-trivial T-invariant measurable functions (con- 
stants of the motion). Indeed, if f : Æ — R is such that f oT = f then 
Na := {x E X : f(x) =a} C X is measurable; moreover, as x € T~1(Na) 
implies Tx € Na, then f(x) = f(T'x) = a. Thus, T~1(N,) C Na and er- 
godicity forces Na to equal either ¥ or Ọ p-a.e. for all a € R, whence 
f(x) =cy prae. on ¥. 

2.1f fe Li(& ), its time-average f is T-invariant by point 2 in Birkhoff’s 
theorem. If (¥,T, u) is ergodic, from the previous remark and point 3 in 
Birkhoff’s theorem, f(x) = c7 u — a.e; thus, u(f) = c7 = f(z) u — a.e: 
on ¥. Namely, ergodicity implies that time-averages and phase-averages 
(mean-values) of (summable) observables coincide. Vice versa, dynami- 
cal systems where time-averages and phase-averages coincide are ergodic 
because of Proposition 2.3.1 below. 

3. If the only T-invariant measurable functions are constant almost every- 
where on Æ, then (¥,T, p) is ergodic: in fact, the characteristic functions 
of T-invariant measurable subsets are T-invariant and must then be con- 
stant almost everywhere, namely equal either to 0 or to 1 pi-a.e. 


4. The average time spent within B by almost all phase-points of an ergodic 
t-1 


= 1 
system equals the volume of B. Indeed, let T(z) = z9 1s(Tĵs); 


a= 
count the mean number of times B is crossed by the trajectory {T"x}nen 
during a span of time of length t. Then, ergodicity yields 


Ta(z) = lim Tp(£)=u(B) p—ae.. (2.60) 


t— +00 


Proposition 2.3.1. A dynamical system (X,T, u) is ergodic if and only if 
for all measurable A, B it holds that 


t— co 


lim : > u(ANT~8(B)) = u(A)u(B) . (2.61) 
3s=0) 
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= 1 
Proof: Consider 14(2)Tp (2) =n > Lanr-s(p)(x), by Birkhoff’s theorem 
and Lebesgue dominated convergence theorem (see footnote 8) it follows that 


t-1 


w(1alg) = lim = 7 MANT *(B)) . 


) follows from (2.60). If (2.61) holds, then 
B), whence u(B) equals either 0 or 1. 


If the aytem is ergodic, then (2.61 
A= B = T-!(B) = p(B) = u( 


Definition 2.3.3 (Mixing). A dynamical system (X,T, 4) is mixing if and 
only if for all measurable subsets A,B C X it holds that 


lim u(ANT™(B)) = u(A)u(B). (2.62) 


t—+00 


The subsets A N T~'(B) consist of those points of A that visit B at time 
t; thus, (2.62) asserts that relative to the volume of any measurable subset 
A, the volume of points of A that will eventually be in another measurable 
subset B equals the volume of B. In other words, mixing dynamical systems 
are in the long run characterized by the uniform spreading of their measurable 
subsets; on the other hand (2.61) states that ergodicity amounts to a uniform 
spreading on average. 

From a physical point of view, quantities like u( A N T~'(B)) are two-point 
correlation functions; thus, mixing characterizes dynamical systems whose 
two-point correlation functions factorize asymptotically, whereas ergodicity 
corresponds to two-point correlation functions factorizing in the mean. 


Remarks 2.3.2. 
n—l1 


1.If i n = Q, an € R, then, lim + = a, wh 2.62) im- 
ima a, An € en e a, whence ( ) im 


plies (2.61) and mixing implies ergodicity. The opposite is not in general 
true as the time-average can get rid of those s for which «(ANT *(B)) # 
u(A)u(B). There is a third asymptotic behavior, intermediate between 
ergodicity and mixing, known as weak mizing [313] and related to the 
fact that 


yen 
Jim, |an — a| = 0 => Jim a ~ lan -a= 0 
j=0 
1 n-1 
lim — n— =0. 
a alae a)| =0 
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Weak mixing amounts to the request that 


Jim FAN T-(B)—H(AB))|=0; (2.63) 
s=0 


it is implied by mixing and implies ergodicity. 

. Given an invertible map T, a stronger notion of mixing is formulated 
as follows [91]. Given any finite collection S, := {S;}'_,, Si € X, of 
measurable subsets of 4, denote by 


N 


the o-algebra generated by all possible atoms of the form T~*($;) for 
k > n and S; €S,. Then, (4,7, u) is said to be K-mizxing if 


im sup __|u(S N B) — (So) w(B)| = 0, (2.64) 
nO BET 2° (S,) 


for all So,S, € X. Observe that x € ©°(S,) implies Tg € S; at 
some time k > n for some atom S; E€ Sr; therefore, K-mixing amounts 
to the uniform statistical independence of any given measurable subset 
from the trajectories of any finite family of subsets if these are considered 
sufficiently far away in the past. 


By using the density of the algebra of simple functions G(4) in the 
Hilbert space L? (X) as in Example 2.1.1, it is convenient to reformulate (2.61) 
and (2.62) in terms of square-summable functions. It is thus possible to study 
how those properties constrain the spectrum of the Koopman operator Ur. 


Proposition 2.3.2. A dynamical system (&,T, u) is 
1. ergodic if and only if for all p, € L? (X) 


i 
lim 4) Hh Go T%) = pihalo) ; (2.65) 
s=0 
2. mixing if and only if for all ,o € L? (X) 


im (do T") = ulh)ulo) - (2.66) 


According to the Koopman-von Neumann formalism of Example 2.1.1, us- 
ing (2.5), it turns out that u(y o T*) = (y*|U5|¢). Therefore, ergodicity 
and mixing can conveniently be expressed as weak-limits, that is as limits with 
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respect to the weak topology (see Remark 2.2.3.3). Then (2.65) and (2.66) 
are equivalent to 


t-1 


_ 3 
A aaa (2.67) 
w— lim Ue =|) 4); (2.68) 
The constant function | 1) is such that Ur|1) = |1); if there exists 


|) # |1) such that Ur| Y) = |W), then one can orthogonally decompose 
|) =a|) + 6] ¢) with (| 1) =9, ||| = 1 and Ur| ¢) =|¢). Thus, 


1= lim (6|Up le) 4 ($ID)? =0, 


whence (2.68) cannot hold. If (2.68) holds, a similar argument excludes the 
presence of eigenvectors |Y, ) such that Ur| Yy ) = e*| Ya ). Therefore, 


Proposition 2.3.3. A dynamical system (X,T, p) is mixing only if 1 is the 
only eigenvalue of its Koopman operator and it is not degenerate. 


In order to see the impact of ergodicity as expressed by (2.67) on the 
spectrum of Ur, we use [313] 


Proposition 2.3.4 (von Neumann Ergodic Theorem). Let Ur be the 
unitary Koopman operator acting on the Hilbert space H := L? (X) ofa 
dynamical triplet (X,T, u), with T invertible. Let A, : H + H be defined 


t—1 
1 
by Ail a) := z) Ulv), w € H, and let P project onto the subspace K 


s=0 
of vectors such that Ur| Y) = |Y). Then, ’ lim (A: — P)¢|| = 0; in other 


words, P is the strong limit (see Remark 2.2.3.1) of the sequence of operators 
At, P=s— limy++400 Ar. 


Proof: The subspace orthogonal to K is (Ur — 1)H; thus, for any w € H, 


lY) = Plo) +- P) y) = Plo) + (Ur -1)lġ) , 
Uh- 1 


for some ¢ € H. Since A4+(Ur — 1) = , the result follows from 
Iz= DIA) < allel 


IA: = Ppl s SE ' 
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Corollary 2.3.1. A dynamical system (¥,T, u) is ergodic if and only if 1 is 
a non-degenerate eigenvalue of the Koopman operator Ur. 


Proof: Since strong convergence implies weak convergence, condition (2.67) 


means that ergodicity is equivalent to P = |1)( 1. 


Remarks 2.3.3. 


1. 


By substituting Y, € L2(4) with Y — u(y) and ¢ — p(¢@) (they also 
belong to L? (¥)), ergodicity, respectively mixing amount to 


7 
He Ae eor =); Him uweoT) =0, (2.69) 


for all 4%, € LZ (X) with u(y) = u(¢) = 0. 


. Incase T is not invertible, the Koopman operator is not unitary, but just 


an isometry, that is UU = 1, while UUt 1. If T is invertible, time 
averages can be extended from —co to +00 and, because of T-invariance 
of u, it does not matter which one of y and ¢ is the time-evolving function 
n (2.65) and (2.66). 


Examples 2.3.2. 


1. 


2 


Ergodic rotations as in Example 2.1.2 are never mixing for the Koopman 
operator has the exponential functions e,(@) as eisenTunctions) 


. The system in Example 2.1.3 is mixing. Given ~,¢ € L3,(T?) with 


p(w) = u(d) = 0, let £ > 0 and choose L > 0 such that ||q — well <eand 


lo — de|| < €, where pe = Di mi<r b(m)em and $: = yl <7 d(n)en 
Then, using (2.22), 


lul% bo TR)| < e (lell + lloll) + lulpe de 0 T?)| 
<e(ldll + lidell) + So (mn) |d(A-Pn)] . 


|r|] <SL 
|ATPn||<L 


When p — ov, hyperbolicity permits ||n|| < L and ||A~?n|| < L only if 
n = 0; since p(¢) = ¢(0) = 0, the Arnold Cat Map satisfies (2.66). 


. Conditions for the Markov chain in Example 2.1.5 to be mixing can be 


derived by considering two-point correlation functions involving cylinders, 
of the form 


[0,p—1] ca- 1] [0,p—1] [t,t-+q—1] 
is “tp— ah ae Cio. “Jaq- i= p(ch “tp— ol pone ‘Jq- 5 ` 


By means of the matrix P = [p(i|j)] of transition probabilities and 
of (2.36), one writes 
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a( Oeil n P) = > plio: ip—1kip > kt-1jo +: jq—1) 


leg cheat 
d 
= X ) (I Pa) Joke-1 X I Pregiky | X 
kp ke—1=1 a=0 kettis ke-a bp 
p=2 
x Pkpip—ı ( II Piat) p(io) 
c=0 
d 


z D (I Page jokt-1 (Pe | Ome Pkpip-ı p(io A -ip—1) i 
Oa 


kp,kt—-1=1 a=0 Ker ta) 
Using (2.34) and (2.35), it follows that (see [61, 313]) 
[0.p—1] [t.t+q-1]) _ [0,p—1] [0,4—1] 
Jm a(o: i 1 NC ) _ u( ch oh aie (ce ‘Jq— A y 


is achieved if and only if : lim (P*)i; = pli) for all j = 1,2,...,d; while 


factorization in the mean (and thus ergodicity) holds if and only if, for 
all j = 1,2,...,d, 


t—1 
ma l è 
Quy = tia | Pa = 2) 


4. The condition for mixing in the previous example is certainly satisfied by 
Bernoulli dynamical systems whose matrix of transition probabilities is 
P= [pli] j= (see Example 2.1.5). 


2.3.1 K-Systems 


Consider an invertible Bernoulli system (25, To, 14), where the space of p-adic 
doubly infinite sequences 2 € OF is equipped with the o-algebra generated 
by cylinder sets and p is a translation invariant product measure on X. Let 
Cio} = {eres be the finite partition consisting of simple cylinders as 
in (2.25) and consider the o-algebras 


C u= V T3’ (Cto) = V Cty (2.70) 
j>0 j>0 
Cn = T3 ey) = V Ca (2.71) 
j>-n 


generated by union and intersections of cylinders of the form (see (2.26)) 
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q 
Ces = N am , ee tptpp1 + tq E oe ’ 
J=P 
for any q > p > —n. From Section 2.1.1 and Examples 2.3.2.3-4, one deduces 
that: 


NC C Cno ti) V C=, aay NCN, (2.72) 


n>0 n>0 


where WN is the trivial o-algebra consisting only of the empty set @ and of 
Q7, all equalities being understood up to sets of zero measure. Condition ii) 
expresses the fact that cylinder sets Ci, 4) with p,q € Z generate X, while in 


condition tii), 
A cm = A V T Cto) 


n>0 n>0 j>n 


denotes the largest o-algebra, called tail of Cro} (Tail (Cto})) contained in all 
C-n] with n > 0. 

Cylinders in C_, are of the form one with p > n, q => 0; they become 
subsets of Tail (Cto) when t — +00. Then, from the mixing relation in 
Remark 2.3.2.3, one deduces that the characteristic functions of these atoms 
go into the constant functions pos) ll, asymptotically, whence condition 
iti). Bernoulli shifts are particular instances of Kolmogorov (K -)systems [91] 
and Coj a particular example of K’-partition. 


Definition 2.3.4 (Classical K-systems). A discrete-time dynamical sys- 
tem (X,T, u) with o-algebra X is a K-system if there exists a o-subalgebra 
(a so-called K-partition) Xo C X that gives rise to a nested K-sequence of 
o-subalgebras X; := T*(Xo) such that 


1. Di := T'(X) C Siyi for allt € Z; 
2. Viez Xr = a; 
3; Nez a4 = N. 


For Bernoulli shifts, the partition Cro} is such that V„ez Ts (Cto) = X 
and Tail (Cro) = N: Cio} is a generating partition with a trivial tail. 


Definition 2.3.5. Let (X¥,T, p) a measure theoretic dynamical triplet with X 
as o-algebra. 


1. A finite, measurable partition P of X is called generating if (apart from 
sets of zero measure p) 


+00 +00 
\/ T(P)=X (T invertible) or \/ T4(P) = (otherwise) . 
j=0 


j=—00 
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2. The tail of a finite measurable partition P is defined by 
Tail (P) := A V TP) (2.73) 
n>0 k>n 


and will be said to be trivial if Tail (P) = N, that is if all its subsets equal 
Ü or X up to sets of zero measure p. 


Remark 2.3.4. A generating partition P = {P; }iez consists of X-measurable 
atoms P; such that unions and intersections of their images T”(P;), in the 
past, n < 0, and in the future, n > 0, generate X. Instead, the refinements 


PLn] i= P T—*(P) are the o-subalgebras generated by the atoms in the 
past of P up to a discrete time t = —n. Since P_y_1) GC P-n], one can also 


loosely write Tail(P) = lim P_n] to indicate that the tail of P contains 


all measurable subsets generated by the remote past of P. As such, tails are 
T-invariant. 


From the preceding discussion concerning Bernoulli shifts, there clearly 
appears a relation between the triviality of the tails of partitions and the 
dynamical system mixing properties. 


Proposition 2.3.5. A dynamical system (X,T,p) is K-mizxing (see Re- 
mark 2.8.2.2) if and only if all its finite partitions have trivial tails. 


Proof: Consider a finite partition P and its tail. By definition, Tail (P) is 
mapped into itself by T; thus, if in (2.64) So € Tail (P), then So belongs to 
P n = Vion T-I (P) for all n and thus to the o-algebra ©'°°(P), generated 
by P_n}. Therefore, one can choose B = So in (2.64) which then yields 
u(So N So) = u( So)? and, in turn, Tail (P) = N. 

Vice versa, let us choose as S, in (2.64) a finite partition P 9 and con- 
sider the o-subalgebras P_n) C X generated by the infinite refinements 
Vion I *(P). The corresponding conditional probabilities (S|P_n})(x), 
S € X (see (2.49) and (2.50)) are such that, for any Ag € X and B € P_yy, 


(Ao O B)(2) = w(Ao)(B)| < f dn (2) [uC Aol Py) Ce) ~ no) 


Because of Theorem 2.2.1 and Examples 2.2.4.1,3, from P_n] | M it follows 
that u(Ao|P-n]) (£) > (Ao) u-a.e. when n — 00, whence K-mixing follows 
from Lebesgue dominated convergence theorem. 


In the next chapter, by using entropic tools, we shall show that all finite 
partitions of K-systems have trivial tails and are thus K-mixing; at this point 
it suffices to observe that 


Starting from the finite set S, of measurable subsets {S;};—1, one constructs 
the partition of ¥ consisting of Sg := Ni Si, Sj := S;\S6 and Shy = X\Ui_, Si. 
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Proposition 2.3.6. If a dynamical triplet (X,T, u) has a generating parti- 
tion P with trivial tail, Tail (P) = N, then it is a K-system. 


Proof: The o-algebra Po := Vp>o T” (P) is a K-partition. 


Examples 2.3.3. 


1. As for Bernoulli shifts, also for the Markov shifts in Example 2.1.5, 
the partition P = {Po} consisting of simple cylinders as in (2.25) 
satisfies condition i) and ii) in (2.72). However, the argument used 
when discussing the triviality of the tail for Bernoulli shifts shows that 
Tail(P) = N if and only if the Markov shifts are mixing in which case 
by Proposition 2.3.6 they are K-systems. 

2. Consider Example 2.1.2 with the frequencies w;,2 such that the system 
is ergodic (see Remarks 2.1.2.2 and 2.1.2.3). As a partition of T?, choose 
the Cartesian product C of the partitions of the 1-dimensional torus T 
into atoms C1 = {0 <0 < m}, C2 = {r < 0 < 2m}. Because of ergodicity, 
the trajectories of the end points of the atoms C; x C; fill T? densely and 
the intersections of their images T*(C; x C;) under the dynamics become 
finer and finer and approximate better and better the Borel c-algebras 
of T?. Actually, this already occurs if one restricts to T~/(C) with j > 0, 
namely a T~4(C) = X. This also means that Tail (C) = X. 

3. The partition in Example 2.2.2 of the two-torus into the vertical half- 
rectangles gives thinner and thinner vertical rectangles while moving into 
the past, and thinner and thinner horizontal rectangles into the future. 
Their intersections are squares of increasingly small side, by means of 
which one can approximate better and better every Borel subset of T?. 
The tail of such a partition is trivial due to the fact that the Baker map 
acts as a Bernoulli shift with respect to it. 


In order to set the ground for a quantum extension of the notion of 
K-system (see Section 7.1.4), we operate a reformulation of the conditions 
in Definition (2.3.4) in terms of algebras of functions. Given a K-sequence 
{2 := T* (Xo) jez of o-subalgebras, consider the Abelian von Neumann sub- 
algebras M; := LX (X, X) = OF[Mo] consisting of the essentially bounded 
3}-measurable functions on Æ¥ (see Section 2.2.2). Then, one has 


Mec Mmi, VMi=M, NM = {1}, 


tEZ 


where the generation of M by V is by strong-operator closure on the Hilbert 
space L? (¥), while A denotes set-theoretic intersection. 

We shall see in Section 5.3.2 that unital Abelian von Neumann algebras 
M can always be identified with suitable L? (X) and represented as multipli- 
cation operators on the Hilbert space L? (¥ ). It thus makes sense to provide 
an algebraic reformulation of Definition 2.3.4 (see Definition 2.2.4). 
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Definition 2.3.6 (Classical Algebraic K-Systems). 
A classical von Neumann algebraic triplet (M,Or,w) is an algebraic K- 
system, if there exists a von Neumann subalgebra No C M such that, setting 


Ni := O4(No), te Z, 


1. Ni C Nea1 for all t € Z; 
2. Viez M =M; 
8. Maez Ni = {Al}. 


Any such sequence {Ni }tez of von Neumann subalgebras of M will be called 
a classical K -sequence. 


Remark 2.3.5. The above definition can also be formulated in an Abelian 
C* algebraic context; there, M will be a C* algebra as well as the subalgebras 
of the K-sequence and V,ez ^M will denote the algebra generated by norm 
closure. The classical spin chains discussed in Section 2.2.3 are instances of 
classical C* K-systems: with M = Dz, No will be the left half-spin chain Do 
generated by the diagonal matrix algebras Di,,q) with p < q < 0. Then, the 
algebraic K-sequence will consist of the subalgebras Dy = Oo (Do) generated 
by the diagonal matrix algebras Di, 4) with p < q < t. 


Given a measure-theoretic K-system with a K-sequence of o-algebras 
{Xn}nez, instead of considering the von Neumann algebras Mn, one may 
focus upon the Hilbert spaces H, := L? (av , Xn) of square-summable X,- 
measurable functions on æ. From the conditions i), ii) and iii) in Defini- 
tion 2.3.4, it follows that 


l; H: C Hipi for allt € Z; 
2. Urez He = H; 
3. Miecz He = C1, 


where H := L? (X) and C 1 stands for the Hilbert space consisting of constant 
functions on ¥ (u-a.e.). Since, according to the construction of the unitary 
Koopman operator in (2.4), 17(5,)(x) = 15,(L~!x) = (UF 1s,) (£), it follows 
that H; = U7 "Hb; whence, setting Ki := Hy+1 © Hn, 


t#s—=>K,LH,, H=Q@K,. (2.74) 
teZ 


By choosing an orthonormal basis {| fj )}je7 in Ko, one gets an orthonormal 
basis for H; of the form {|e;4) := Uz"| f;)};e7 and thus one for H of the 
form {| e; )}jeJ,tez. Any unitary operator U on a separable Hilbert space H 
which generates an orthonormal basis of the previous form is said to have a 
Lebesgue spectrum of multiplicity J. 


Proposition 2.3.7. /91] For a K-system (¥,T, u), J is countably infinite. 
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Proof: Since Hp C Hy; there surely exists f € Hı with g := f — E(f|X5), 
where E(f|Xo) is the conditional expectation of f with respect to Xo, such 
that E(\g|?|20) #0 on a Xo-measurable subset So with u(So) > 0. Consider 


the function G € Hı defined by G(x) := poe O Oe 1s (x); from the 
E(|g|?|Xo) (x) 
properties of the conditional expectation it follows that 


H(g1¥)(2) a 
E(G|X)(x) = eine Ms so(z)=0 (x) 
BiGo) = FUE EN 1 la) = tala) Co). 


Let {| e? )}xen be an orthonormal basis in the Hilbert space L? (So) of square 
summable functions supported within Sp and set | f? } := Mal. e? ) where Mg 
denotes the multiplication by G, namely f(x) = G(x) e?(x). Notice that 
| fP) € Hı; further, by using (2.51) and (2.53), it follows ‘int | fR) L Ho, 
whence | f?) € Ko = Hı © Ho as defined in (2.74). Indeed, let | ho) € Ho, 
then (*) yields 


(hol fe) -f dy (x) ho(x) G(x) ef (2) = A dy (x) E(hg G e}|20)(2) 


du (x) ho (x) E(G|Xo) (x) eg(x) = 0. 


So 


Also, the set {| f?)}nen is an orthonormal basis for 
(FIFe) = f an du (2) (€2)* (2) |G(a)[? e2(2) 
= f ana) B (eS) GP elz) ) 
So 
du (2) (€2)* (2) E(IGPISo) (E) el) = (e leg) = dix « 


Therefore, Kı must be an infinite dimensional separable Hilbert space. 


2.3.2 Ergodicity and Convexity 


We conclude this section by considering some aspects of ergodicity and mixing 
in relation to continuous dynamics on compact, metric spaces and to the 
convex space M(X,T) of their regular, T-invariant Borel measures. The first 
result [313] states that ergodic measures are extremal in M(X,T), namely 
they cannot be decomposed into convex combinations of other measures in 
M(#,T). We shall make use of the algebraic setting of Definition 2.2.4. 
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Proposition 2.3.8. An algebraic triplet (C(¥X),Or,w,) is ergodic if and 
only if M(*«,T) D Wy, = Aw, + (1 - A)we, 0<A< 1, w12 € M(%*,T) 
implies wp = W4,2. 


Proof: Suppose a T-invariant Borel measurable subset FE exists such that 
0 < u(E) < 1 and let E° := X\E. With Ig and Ig their characteristic 
functions and w,,(1z) = (E), the two states 


O(x) fof) HAE | C(x) fio wl) = 
H(E) 
are different and both in M(&,T); furthermore, they decompose w,,. for 
wu = P(E) wy + (1— p(B) wa. 

Suppose w, can be decomposed as stated in the proposition, then the 
measure pı E€ M(X,T) corresponding to w; is absolutely continuous with 
respect to u (see Remark 2.2.4.2). Let fı(x) > 0 be its Radon-Nikodym 
derivative. Consider the measurable subset E = {x € Æ : f(x) < 1}. Observe 
that one can decompose E = (E N T~!(E)) U (E\T71(E)) by means of 
disjoint subsets and, analogously, T~!(E) = (THE) N E) U (T7\(E)\ B). 
As oT" = p, 


w(i) = f ra VO AG) + Í a PO AO) = Cra) 


= aule) Ale) + f d(x) fila) . 
T-1(B)NE T-1(E)\ E 


Therefore, as fı < 1 on E while fı > 1 outside it, it follows that 


w(E\T-1(B)) > f 


E\ T-1(E) 
> w(T~*(E)\ E) . 


da(z) faa) = I dala) file) 


T-1(E)\ E 


Then, (E\T-!(B)) = u(T-!(E)\ E) = 0 since 


Thus, E is T-invariant apart from sets of 0 measure p; if the system is ergodic, 
this implies either u(E) = 0 or u(E) = 1. The latter equality cannot hold, 
otherwise 1 = wı(1) = wı(lg) < u(E) = 1; thus, (E) = 0. The same 
argument applied to F := {x € ¥: fı(x) > 1}, leads to u(F) = 0 whence to 
fi(x) = 1 p-a.e. on X which implies w, = w1 and thus extremality. 


The second result is a refinement [313] of Proposition 2.3.2. 
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Proposition 2.3.9. The triplet (C(X),Or,w,) is ergodic, respectively miz- 
ing if and only if, for all f € C(X) and g € L} (4), 


t—1 

lim + olf 0 OF 9) = wy fey (a) (2.75) 
s=0 

Jim wy(fo Org) = wy(f)onlg) - (2.76) 


Proof: If (2.75) holds, it implies (2.65) by the fact that any Y € L2(4) is 
also summable and can be approximated in L? (X ) by continuous functions. 
Vice versa (2.65) implies (2.75) as any f € C(¥) also belongs to L? (&¥) and 
any f € L} (X) can be approximated in L} (X) by square-summable func- 
tions. The same considerations can be used to prove that (2.76) is equivalent 
to (2.66). 


Example 2.3.4. Given the algebraic triplet (C(4’),Or,w,), let v be an- 
other state on C(4) absolutely continuous with respect to w,,, but not Or- 
invariant, that is, for all f € C(%¥), 


Uf) = | du (x) g(x) f(a), vO) = / du(2) g(x) = wulg) =1, 
x x 


with Radon-Nikodym derivative g, # gy 0 Or € L}(¥). From a physical 
point of view, w, can be considered as a perturbation of the equilibrium 
state w. By duality (see (2.8)), for all f € C(¥), (f) = v(f o OF) where 
v, := v o Oz". If (C(x), Or,w,) is mixing, then (2.76) implies 


jim (f) = wul f) wulg) = wul f), VE C(#). 


Physical instances of measures that are absolutely continuous with respect 
to an invariant one are local perturbations of equilibrium states; then, being 
mixing guarantees that these perturbations fade away in time and provides 
a mathematical explanation of relaxation to equilibrium. 


2.4 Information and Entropy 


At its simplest, information theory is concerned with the description of two 
parties transmitting information to each other. Information is physical as it 
is encoded into physical carriers, e.g. electromagnetic waves, that undergo 
physical processes, e.g. interactions with an optical fiber. As long as the 
laws that describe these processes are those of classical physics, one talks of 
classical information theory. 
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Fig. 2.2. Classical Transmission Channel 


2.4.1 Transmission Channels 


In the following, we shall consider two parties A and B exchanging signals 
according to the following typical scheme (see Figure 2.2): 


1. At each use, a classical source emits symbols i from an alphabet consist- 
ing, say, of integers I4 = {1,2,...,a}: symbols are emitted with proba- 
bilities 74 = {pa(i)}%,. 

2. After £ successive uses of the source, the source outputs are strings of 
length £, i := iyig---ig € Q, emitted with probabilities p yw (i). 
These strings can be interpreted as outcomes of a random variable 
AM = eae A; which is the join of successive random variables from 
the stochastic process {A;}ien (Ai := A) associated with countably suc- 
cessive uses of the source. The random variable AM is distributed accord- 
ing to the probabilities myc) = 4 Dace (4) Wee that £ subsequent 

a a 
uses have actually emitted a given string of symbols. 

3. The sender encodes the emitted strings i into strings of fixed length 
n, £”) = tito- £n E a, consisting of symbols x; from another 
alphabet Ix := {1,2,...d}. The encoding procedure amounts to a map 


Em) ; RP = a, 
EM » 2 54 4 EMG) = a™ € 2™ | (2.77) 


4. The code-words 2”) are then sent to a receiver via a transmission chan- 
nel, C™) =C x C x---C, which transforms an input string x™ into an 
output string y™ = C(™ (al) = yiy2* -Yn E R, consisting of sym- 
bols y; from, possibly, another alphabet Ty := {1,2,...,«}, according to 
a set of transition probabilities (compare Example 2.1.5) 


rey |a™) > 0, XO pole) =1. (2.78) 
yen” 
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These latter quantities take into account the possibility that the trans- 
mission channel be noisy and thus might randomly associate different 
outputs y™ to a same input a”). 


. The channel inputs and outputs are thus random variables X‘) and 


Y) with outcomes « and y™. If the code-words «) occur with in- 

put probabilities Tyn) = {pxim (EM) | wenei the transition probabil- 
d 

ities provide joint probability distributions for the joint random variables 


X™ v Y™ given by 
Txmyy(r) = {pxcvve (a, y) : (a, y™) € oF x at 
Pximyy tm (2, y™) = pgm (2) p(y™ |æ) . (2.79) 


Consequently the output random variables Y) are distributed according 
to the marginal probability distributions Tyn) = {Pro (YF men 
where 
Pym (y™) = ys Pxmyym (2™, y) i (2.80) 
area” 


. At the receiving end of the transmission channel, the output string y”) 


goes through a decoding procedure whose aim is to retrieve the actual 
source output i that has been encoded into #2” = € (i) from the 
received string y = C™ (æ). Decoding amounts to a map 


2M) > y™ o Dy) =i ERO. (2.81) 


The whole procedure comprises the following steps 
oO gy & ae BS 3 


— => 


£ n n £ 
— 2 en e 2” ERP 


The efficiency of the transmission is related to how much the decoded word 


a differs from the word 4 that, after being encoded into a”), has been sent 
through the noisy channel and received as y. The task is thus to minimize 
decoding errors while keeping a non-vanishing number of bits transmitted 
per use of the channel. 


In Section 3.2.2, we shall consider the class of memoryless channels with- 


out feedback; they act on input symbols in a way which is statistically in- 
dependent from previous inputs and outputs. As such, they are completely 
specified by factorized transition probabilities: 


p(y |x) -JI plyilzi) - (2.82) 
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Examples 2.4.1 (Channels). 


1. Noiseless Binary Channel: two classical bits (bits) 0,1 are emitted 
with probabilities p4(0), pa(1) and sent through a noiseless channel: 


p(0|0) = p(1|1)=1, p(0|1) = p10) = 0. 


2. Binary Symmetric Channel: two bits are emitted with probabilities 
pa(0), pa(1) and sent through a channel which flips them according to 


p(0|0) = p|1)=1-p>0, p(1|0)=p(1|1)=p>0. 


3. Binary Erasure Channel: two bits are emitted with probabilities 
pa(O), pa(1) and sent through a channel which does not flip them, but 
may erase anyone of them with a same probability 0 < a < 1. This ac- 
tion is described by a map C from the two-letter alphabet {0,1} onto the 
three-symbol alphabet {0,1,2}, where 2 stays for a junk symbol, and by 
transition probabilities 


p(0|0) = p(1|1)=1-a, p(2|0) = p2ll)=a. 


A noiseless channel is characterized by transition probabilities that equal 
1 in correspondence to specific pairs of input and output strings otherwise 
they vanish. There are then no distortions in transmitting or storing infor- 
mation by means of these channels. In such cases, the question is whether 
the source information can be compressed and retrieved with negligible prob- 
ability of error, the possibility of compression depending upon the presence 
of redundancies and regularities in the source. 

More precisely, if the source emits binary strings of length n, one asks 1) 
whether for each bit from the source one can store h < 1 bits , still being able 
to reliably reconstruct the information emitted by the source from the 2”*" 
bit strings effectively retained and 2) which is the optimal compression rate 
h achievable. This problem is addressed by Shannon’s first theorem which 
asserts that, for stationary sources, the optimal rate is the their entropy-rate. 

In the presence of noise in the transmission channel, the strategy is some- 
how the reverse with respect to noiseless transmission; in order to reduce the 
possibility of noise-induced errors, one introduces redundancies by multiple 
uses of the channel. The aim is to optimize the number of signals that can 
faithfully be transmitted by n uses of the channel. Shannon’s second theo- 
rem proves that this number can be made increase exponentially with n at 
an optimal rate R, the channel capacity. 


2.4.2 Stationary Information Sources 


In most informational contexts, stationary sources are a reasonable descrip- 
tion of the actual physical processes taking place. Stationarity means that the 
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probability of a string i does not depend on when the source had emitted 
it, but only on the letters emitted. This is equivalent to (compare (2.30)) 


S paw (i102 -+- te) = paw (izi3 +--+ ie). 
ay 


This condition goes together with the fact that the probability of a string of 
length €—1 must be the sum of the probabilities of all words of length £ with 
the same first £— 1 symbols (compare (2.29)), 


S paw (iriz +++ ie) = pau-n (iriz: + ig—1) « 
ig 
The similarities with shift dynamical systems now are apparent. 


Lemma 2.4.1. A classical, stationary information source corresponds to a 
stationary stochastic process {A;}inen, where the random variables A; take 
on values in an alphabet I4 = {1,2,...,a} and the joint random variables 
AM := Vi_, Ai are distributed with probability distributions main) satisfying 
appropriate compatibility and stationarity conditions. 

Equivalently, a stationary classical source can be described by the measure- 
theoretic triplet (Qa, To, pa), where pA = HA o I is a state on the set Ra 
of semi-infinite strings i = {i;}jen, ij € Ia, equipped with the left shift To. 
The restrictions u of wa to the sets of finite strings N are given by the 
probability distributions T a(n). 

Finally, a stationary source can be described as a C* triplet (D4, Oo, VA) 
as in Definition 2.2.5, namely by a semi-infinite classical spin-chain con- 
sisting of a lattice of a-valued spins described, locally, by tensor products 


n—1 
D™ = Q D,(C) of diagonala x a matriz algebras Da(C), equipped with an 
j=0 


automorphism Os which amounts to the left shift along the chain and with a 
O -invariant state V4 such that (compare (2.56)) 


nps yY motra n 
iM EQh™ 


Examples 2.4.2. 
1. Bernoulli Sources (see Example 2.1.4): the probabilities of strings are 
products of the probabilities of their symbols, Pam (i™) = [[ x4), 
j=l 


that are statistically independent from each other. 


2.4 Information and Entropy 61 


2. Markov Sources (see Example 2.1.5): the probability of emission of the 
n-th symbol depends only on the n — 1-th one, namely 


pam (t™) = plin|ir, i2,- in-1)Paa-n (inte «++ in_1) 
= pltn|in—1) Datn—v (419+ Ini) 
= P(in|¢n—1) P(én—1lin—2) +++ pa(izli1)pa (i1) , 


where p(inli1, i2,***in—1) are the conditional probabilities for the occur- 
rence of the in-th symbol if the symbols 71,72,...,%,-1 have already 
occurred. Using (2.30), it follows that stationarity is equivalent to the 
probability vector | 74) = {pa(t)}ier, being eigenvector, relative to the 
eigenvalue 1, of the matrix of transition probabilities. 


2.4.3 Shannon Entropy 


Like an information source A that emits symbols j € {1,2,...,a} with proba- 
bilities p4(j), also a partition P = {P;}?_, of the phase-space of a dynamical 
system (¥,T, p) into atoms with volumes p(P;), can be interpreted as a clas- 
sical random variable. In the latter case, randomness is related to the fact 
that the phase-point or state of the system is localized within the atom P; 
with probability u(P;). 

The notion of entropy measures the amount of uncertainty about the out- 
comes of a random variable like P before the phase-point has been localized 
within a definite atom, for instance as a consequence of an observation or 
a measurement process of sort. Equivalently, entropy measures the amount 
of information, relative to the partition P, that has been gained after the 
phase-point of the system has indeed been localized in one of its atoms. 


Definition 2.4.1 (Shannon Entropy). The Shannon entropy of a discrete 
random variable A with probability distribution Ta = {pa(j)}Fa1 is given by 


a 


=~ Dal )logpa(j) = So n(pals)) 5 (2.83) 


j=l 
where 
0 z=0 
n(x) = aie 0<a<l (2.84) 


Remark 2.4.1. The Shannon entropy plays for discrete dynamical systems 
the role played by Gibbs entropy for continuous systems which is defined 
s [167, 300] 


= J dz p(x) log p(x) , 
x 


for a state on the phase-space ¥ with probability density p(x). 
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The Shannon entropy is such that H(A) = 0 if and only if one outcome, 
say j*, occurs with probability p4(j*) = 1, while pa(j) = 0 for j 4 j*; it 
reaches its maximum H(A) = loga, when all outcomes are equiprobable, 
pa(j) = 1/a. Indeed, the function n(x) is concave, whence 


r(logx — logy) > x-y, Va,ye€ [0,1], (2.85) 


with equality holding if and only if x = y. 
Let then E be a random variable with tz = {pp(j) = 1/a}$_,, then 
H(E) = loga and 


H(A) - =~ Lmtd )(logpa(j) + loga)) < X (pal) - 1/2) = 0. 


j=1 


Given two random variables A and B, we shall keep the notation used for the 
join of two partitions and denote by A V B the random variable with joint 
probability distribution 


TAVB >= {pavB(t,j) }iel4,j€Ip , Ia = {1, 2, Srey a} ’ Ig = {1, 2, sing b} . 
(2.86) 
By summing over the outcomes of A, respectively B, one obtains the marginal 
probability distributions m4 := {pa(t)}ier, and Tg := {pa(j)}jer,, where 


b a 
z > paveli j) , pal) = > Paveli j) ; (2.87) 


Lemma 2.4.2 (Subadditivity). Given two random variables A and B, 
H(AV B) < H(A) + H(B). (2.88) 


Proof: Given Tavpg as in (2.86) and 7,4 and 7g as in (2.87), use (2.85) 
with x = DAvB(t, j) and y= pa(t)pa(J), 


H(A) + H(B) - H(A V B) = man noate baste Paved ; 


> S (pavB(t,j) — pa(i)pB(j)) =0. 


icla,jEIB 
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Remarks 2.4.2. 


1. As already observed, any finite (measurable) partition P = {Pi}1, of 
(X,T, p) is a random variable P whose outcomes correspond to the labels 
of the atoms to which the system phase-point happens to belong. The 
volumes of the atoms give the probabilities of such occurrences, so that 
a finite partition P also attributes to the random variable P the natural 
probability distribution mp = wp = {u(Piierp- 

2. In analogy with Definition 2.2.1.2, a random variable A is finer than 
a random variable B (B = A) if each outcome j € Ig of B is deter- 
mined by a subset I, C I4 of outcomes of A. It follows that, if A has 
probability distribution 74 = {p4(i)}ier, and B probability distribution 
TB = {paQ@)}jets, BX A implies pa(j) = Vier Pali). 

3. According to Definition 2.2.1.3, the refinement P V Q of two partitions 
P = {P;}icrp and Q = {Q;}jerg, is a random variable P V Q with joint 
probability distribution upyg = {u(PiN Qj) }ierpjerg: P V Q is finer 
than both random variables P and Q; also, Q x P => PVQ=P. 


2.4.4 Conditional Entropy 


Because of possible statistical correlations, the knowledge of a random vari- 
able A may decrease the uncertainty about another random variable B; the 
less so, the more A and B are statistically independent. These intuitive ar- 
guments are formalized by introducing the notions of conditional probability, 
conditional entropy and mutual information. 

Consider two random variables A and B with probability distributions 
TA = {pa(t)}ier,, respectively mg = {pp(j)}jer,, and joint probability 
TAvB = {pAvB(i, J) }iera,jerg- The quantity 


ie pavB(i, j) 
palili) = = (2.89) 
1 PBJ) 
represents the probability of the outcome A = i conditioned upon the out- 
come B = j. Altogether, t4\p=; = {paj (ilj) }{— is the conditional probabil- 
ity distribution of A conditioned upon the outcome B = j. The conditional 
probabilities are such that 


Pap) S05 Papa Gl)=1. Yj=1,2,...b. 
i=l 


The notion of conditional probability is naturally associated to that of 
conditional entropy which measures the amount of uncertainty about a ran- 
dom variable A which is left once that relative to another one, B, has been 
removed. 
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Definition 2.4.2 (Conditional Entropy). 

Given two random variables A and B with probabilities 74, Tg as in (2.87) 
and joint probability navn as in (2.86), the conditional entropy of A with re- 
spect to B is 


H(A|B) = S00) H(A|B = j) (2.90) 


Av B(i, og PABU J 
== Spot X PREGA iog Ateh I 
i=l 


PBJ) 
= a ee — H(B), (2.91) 


where H(A|B = j) is the Shannon entropy corresponding to the conditional 
probability 7 4\B=;- 


Lemma 2.4.3. The conditional entropy fulfils 


0 < H(A|B) < H(A) 
H(AV BIC) = H(A|C) + H(BJAV C) < H(A|C) + H(BIC) . 


Proof: The lower bound follows since the left hand side of (2.90) is positive, 
while the first upper bound is a consequence of (2.88) applied to (2.91). 
Further, using the latter relation one gets 


H(AV BIC) = H(A V BVC)- H(C) 
= H(A V ©) - H(C)+ H(A V Bv C) - H(A v ©) 
= H(A|C) + H(BIAVC), 


while (2.88) applied to H(A V B|C = k) gives the second upper bound. 
Corollary 2.4.1. B < A= > H(B) < H(A). 


Proof: From Remark 2.4.2.3 it follows that B < A => AV B = A; thus, 
H(A) = H(AV B) = H(A|B) + H(B) > A(B). 


Example 2.4.3. If N denotes the (trivial) random variable with only one 
certain outcome, then H(A|N) = H(A) for any other random variable A. 
By the definition of conditional entropy, H(A|B) = 0 implies that in (2.90) 


1 pava(i, j) ig pavB(t, j) 


pa) 7) T e 
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Therefore, for fixed j, pava(i,j) = pa(J) for only one i € T4; thus, for each 
fixed i € I4, the index set Ig can be subdivided into disjoint subsets Jj, such 


that 
pali) = >> pavsli j) = X pals) - 
jel jel 
That is (see Remark 2.4.2.2) A < B; indeed, the outcomes of A are deter- 


mined by those of B. In other words, when knowing B means knowing A, 
then A < B. Viceversa, if B is finer than A, then H(A|B) = 0; in fact, 


AV B = B => H(A|B) = H(AV B)— H(B)=0. 


Remarks 2.4.3. 


1. Conditioning can be extended to random variables A;, i = 1,2,...,n. 
The probability of the events A; = a;, i = p+1,...,n conditioned on the 
events A; = aj;,i=1,...,p is given by 


P(r +++ ApAp41 ++ an) 
a -Anlaa = 
ll a 


where explicit reference to the random variables in p(---) has been omit- 
ted, for sake of simplicity. It follows that also the notion of conditional 
entropy can be extended to 


H(APtTy AP? ...y Am|Aly A? V AP) = — X p(ar,aa,..-,ap) 


A1,.+-Ap 


x 5 plap+1 `` anlar- ap) log plap+1 ` anlar: ap) « 


QAp+1;--; an 


2. A sequence of random variables {A’}jen form a Markov process as in 
Example 2.4.2.2 if p(a,|a1-++@n—1) = p(G@n|Qn—1) for all n € N. In such 
a case H(A"|Alv---A"~1) = H(A™|A"~4). 

3. Since the conditional entropy is positive, it follows that 


H(AV B) > max{ H(A), H(B)} ; 


Both this observation and subadditivity (2.88) agree with the interpre- 
tation of the entropy as a measure of uncertainty. The latter is in fact 
greater about AV B than about either A or B, while, due to possible sta- 
tistical correlations between A and B, the uncertainty of AV B is smaller 
than the sum of the uncertainties of A and B independently. Further, 
due to (2.85), 

H(AV B) = H(A) + H(B) 


if and only if 7 av g factorizes into the product of the probabilities, namely 
if and only if A and B are statistically independent. 
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4. The second upper bound in Lemma 2.4.3 yields H(B|AV C) < H(B\C); 
as C < AVC, this inequality is a particular instance of the more general 
monotonicity property of the conditional entropy established in Corol- 
lary 2.4.2. 


Example 2.4.4. Suppose a random variable is given by a finite partition 
P = {P;}4, with atoms that are measurable with respect to a o-algebra 
X generated by a measure-algebra Xo as in Example 2.2.1. Then, for any 
e > 0, there exists a partition Q = {Q;}4_, with atoms Q; € Xo such that 

H(P|Q) < €. In fact, as showed in the example, one can always construct Q 
such that, for alli = 1,2,...,d, one has 


u(P;) 
< 
u(P; AQ;) oan, z >? 0<8<1. 


P;) 


WP) < (Qi) +E and uP: Agi) > MQ) PNQ) - 


Thus, (Qi) > ao and 6 u(Q:) > u(Q:) — u(P; N Qi), whence 
RAQ: 
prions) = PERH 5 1-5. 


Since Tpjo=; is a conditional probability, it follows that ppjo=:(jli) < 6 for 
j Æ i. Finally, choosing 6 so that the continuous function ds ) in (2.84) be 
such that n(x) < €/d when 0 < x < ð and 1 — ô < x < 1, (2.91) yields 


d 
H(P|Q) = Sg i) > nPpo=lli)) Se. 
i=1 j=1 


Proposition 2.4.1 (Strong Subadditivity). 
Given three discrete random variables A, B and C, the following inequality 
holds, 
H(AV BVC)+H(B)< H(AV B)+H(BVC), (2.92) 


together with those obtained by cyclic permutations of A, B and C. 


Proof: Similarly to the proof of Lemma 2.88, the result follows by apply- 
ing (2.85) as follows: 
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H(AV BV C)+ H(B) —- H(AV B)- H(BVC)= 


ao > er log Pavavel j.k)pa(9) 
pavs(t,j)pBvc(J, k) 


t€l4,je€lp,kelo 


its 1,9 j,k; 
Ss 7 » (pavevcli j,k) pavel Dorset H = 0 . 
i€Ia,jEIgB,kEIc PB\J 


As a consequence of strong subadditivity, the conditional entropy mono- 
tonically decreases upon refinement of its second argument. 


Corollary 2.4.2. B < C => H(A|C) < H(A|B). 


Proof: From (2.92) and (2.91), 


H(AV BV C) — H(BV C) = H(A|BVC) < H(A V B) — H(B) = H(A|B) . 


The result follows since B < C => BVC =C. 


2.4.5 Mutual Information 


A notion related to the conditional entropy is that of mutual information: it 
measures the amount of information about a random observable A that can 
be achieved by knowing another random variable B. 


Definition 2.4.3 (Mutual Information). 
Given two random variables A and B, their mutual information is given 


by 


I(A; B) := H(A) + H(B) — H(A V B) 
= H(A) — H(A|B) = H(B) — H(B|A) . (2.93) 


The mutual information amounts to the relative entropy (also known as 
Kullback-Leibler distance or information divergence) of the joint probabil- 
ity distribution mavs with respect to the product probability distribution 
Tavs = {pa(i)pa(s) biers jer, Obtained from the marginal ones (see (2.87)): 


pavB(t, j) 


OEE | a (2.94) 


S(žavs, Tava] = X pava(i, j) log 
tj 


Since H(A) measures the unconditioned uncertainty about A and H(A|B) 
the uncertainty about A if one knows B, their difference amounts to the 
knowledge of A given by B. If A and B are statistically independent, know- 
ing B does not give any information about A, whence H(A|B) = H(A), 
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and I(A; B) = 0. On the other hand, if A is finer than B, then knowing A 
means knowing B, thus B x A => H(B|A) = 0 and I(A; B) = H(B). Vice 
versa, I(A; B) < H(B) means that H(B|A) > 0 or, in other words, that the 
knowledge of B is unable to remove all the uncertainty about A. 

An interesting inequality involves the mutual information in connection 
with three random variables A, B and C that form a so-called Markov chain 
A— B — C [92]; namely (see Remark 2.4.3.2) 


= pavBvc(t, j, k) = pegevc(j, k) 
pavB(i, j) pB(j) 


Notice that C, B and A form a Markovian chain C — B — A, too; indeed, 
as pova(k, j) = pavo(j,k) it turns pout that 


Po|AvB=(i,j) (Alt, J) : = po\p=; (kl) 


5 F pavByc(i, j, k) : pavB(t, j) 
= (2 k, Se Sh =j kli Sa aria ake 
PalcvB=(k,7) (4 j) pova(k, j) PC|B al Delt 
pavB(t, j) Si 
pat) alB=;(ilj) 


Using the latter property one can prove the so-called data processing inequal- 
ity [92]. 


Proposition 2.4.2. A > B —> C => I(A; C) < I(A; B). 


Proof: From Definition 2.4.3, I(A; B) — I(A;C) = H(A|C) — H(A|B) 
while the Markovianity assumption yields H(A|B) = H(A|B v C) (see Re- 
mark 2.4.3.2), whence, from (2.4.1), 


I(A; B) — I(A;C) = H(A|C) — H(A|B V ©) 
= H(A V ©) + H(B V C) - H(A v BV C) - H(C) 20. 


The meaning of the data processing inequality is that the mutual infor- 
mation of two random variables A and B cannot be increased by any further 
processing of B by a function C = g(B), for this yields a Markov chain 
A>-B-C. 


Example 2.4.5. When dealing with noisy transmission channels, signals a 
from a source described by a random variable A are encoded into code-words 
b(a) that give rise to another random variable B = B(A). Then, the code- 
words are sent through the channel which outputs signals c = c(b), providing 
a third random variable C(B). Altogether, A, B(A) and C(B) form a Marko- 
vian chain A — B(A) — C(B), as well as C(B) — B(A) — A; thus, we get 
the data-processing inequalities 


I(A;C(B)) < (A; B(A)), I(4;C(B)) < I(B(A);C(B)). (2.95) 
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For a mathematical approach to classical dynamical systems and ergodic 
theory see [17, 61, 91, 199, 313]. For a more physical point of view on ergodic 
theory and related questions, one may consult [7, 106, 167, 300]. [16, 299] have 
been used as references for Hamiltonian mechanics and integrable systems, 
as well as [106, 129, 228, 271] for classical chaos. The review [62] discusses in 
detail the signatures of chaos in discrete classical systems. 

For a modern overview of probability theory see [176]. 

For the notions of entropy and conditional entropy in a dynamical system 
context see [17, 61, 91, 164]; consult [92] for the same notions and that of 
mutual information from the point of view of information theory. As regards 
the relations and epistemological links between the entropy of Shannon and 
those of Gibbs and Boltzmann in a thermodynamical setting, see [167, 300, 
314]. 


3 Dynamical Entropy and Information 


Repeated uses of an information source or successive localizations of a tra- 
jectory with respect to a partition of phase-space, give rise to stochastic pro- 
cesses. Since equilibrium states u give rise to shift-invariant probability distri- 
butions, the Shannon entropy is a constant of the motion: for instance, given 
the time-evolved partition T~4(P) in (2.38), one has H,,(T~4(P)) = H,,(P). 
Therefore, it is not the Shannon entropy, rather the entropy rate that is useful 
to quantify the degree of irregularity of the dynamics. Since its introduction 
as a mathematical tool, the notion of entropy rate or, more generally, of 
dynamical entropy, has been playing a major role in the theory of classical 
dynamical systems for it provides links among as different properties as dy- 
namical instability, informational compressibility and algorithmic complexity. 


3.1 Dynamical Entropy 


As in Section 2.4.3, given a dynamical system corresponding to a measure- 
theoretic triplet (¥,T, u), we will consider a coarse-graining of X by means 
of a finite, measurable partition P = {P;}?_, and identify P with the random 
variable (denoted by the same symbol) corresponding to the process of lo- 
calization of the system phase-point (state) within one of the disjoint atoms 
P; that cover VY. The outcomes of P are the labels of the atoms and occur 
according to the discrete probability distribution up = {p (i) := u(Pi)}P_). 
Further, the time-evolved partition Pi := T~/(P) at time j in (2.38) is iden- 
tified with the j-th random variable of a stochastic process {P!}jez. Thus, 
the refined partitions P(™ with atoms pe as in (2.40) correspond to joint 


n) 


random variables with discrete probability distributions as in (2.41), 


Hp = {ot ca! ?) A HER eam ? 


and Shannon entropies (comparing with (2.83), we explicitly indicate the 
dependence of the entropy from the measure and the partition) 


Hy(P™) = — Sop 4) log pa) . (3.1) 
Me Qh” 
F. Benatti, Dynamics, Information and Complexity in Quantum Systems, 71 
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Definition 3.1.1 (Entropy Rate). The entropy rate of (¥,T, u) with re- 
spect to a finite, measurable partition P is given by 


he (T, P) := lim 1 a (P™) = inf L a (P™) ; (3.2) 
n—oo n n n 


The above limit exists because of the stationarity of u, 
H, (P) = H,(P) Wk>0, (3.3) 


and because of the subadditivity of the Shannon entropy [313]. Together, 
they yield, for all 0 < p< n- 1, 


H,(P™) < H,(P™) + H, (V) 
k=p 


where Hp := HPN. Fix m € N and set n = km +r, 0 < r < m; then, 
from (2.88) 


Hn -Hm | H, 
n © m  km+r` 


Since m is fixed, when n goes to infinity, k goes to infinity as well, whence 


H, H, 
lim sup — A- p 
n— oo n m 


Since m is arbitrary, it follows that 


lim sup — < inf —* < lim inf = . 
n—0o n m m n—-oo n 


The entropy rate can be expressed by means of the conditional en- 
tropy (2.91) of two partitions H,,(P|Q) in such a way that A(up, Ts) measures 
to which extent the knowledge af the past of P may help to predict its future 
outcomes. Recursively using (2.91) and (3.3), one gets 


HPO) = mV pi) + H,,( 2a ei») + H,(P) 


n—2 


H,(P™) = H, (ey V Pi) + H,(P-») 


n—l1 


=e (PVP) + a P). 
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Because of Corollary 2.4.2, the positive terms in the sums are monotonically 
decreasing with increasing n, thus, arguing as in Remark 2.3.2.1, 


nkS (T, P) = tin, 2, ei») = lim m (PV) (3.4) 


V pi) . (3.5) 
j=0 


Consider the first equality in (3.5): PË is the random variable whose outcomes 
depend on which atom of P the system state is in at time i, while VS} Pi 
is the joint random variable relative to the atoms visited at previous times 
0,1,...,7-— 1. Thus, the entropy rate corresponding to P is the average in- 
formation about the next localization provided by the knowledge of all the 
previous ones. 


n-1 v= 
BS (T.P) = jin. Ha (PVP?) = im, 1 (P" 
i=l j=0 


Remarks 3.1.1. 


1. Let (2a, To, TA) describe a stationary information source. Then, the prob- 
abilities m4m, A™ = Vo Aj, together with the corresponding en- 
tropies H(A) refer to the statistical ensembles of strings of length n 
emitted by the source A. As discussed in Section 2.4.3, repeated uses 
of the source A can be described as a stochastic process {A;}j;ez where 
Aj is the random variable associated to the j-th use of the source. The 
entropy rate of the source A is thus given by 


1 
h(A) := lim —H(A™) . (3.6) 
n= n 
The entropy rate h(A) of a stationary source is the entropy per symbol 
of the stationary stochastic process {Af }jez generated by A. 
2. Because of subadditivity (2.88), the entropy rate of a partition is always 
bounded by its Shannon entropy 


hS (T, P) SEP) (3.7) 


Furthermore, since ’ inca u(P&)) = 1, from (3.1), one gets the lower 
bound 


H,(P):=— S pf) logpPGO) > log sup uP), 
MEQ acai 


whence [164] 


1 
KS : š 
hg” (T,P) > —limsup—log sup p(P). (3.8) 
n—+oo n PEP () 


74 
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3. If Q < P, Corollary 2.4.1 implies h¥S (T, Q) < hKS (T, P). 
4. Since u is T-invariant, the conditional entropy is stationary, namely 


H, (T (P)|T = (Q)) = H, (P|Q) . 
Thus, if T is invertible (3.5) can be rewritten as 


n—1 


h#S (T, P) = lim H, (P| Vet): 
j=l 


. Let P and Q be two partitions; then, using Corollary 2.4.1, the rela- 


tion (2.91), Lemma 2.4.3, Corollary 2.4.2 and the previous remark, one 
derives 


H,(P) < Hy(PO VQ) = HQ) + HPQ) 


n-1 n-1 
<H,(O™) + XO a (PQ) < 7,.(0™) + $O HPO") 
1=0 i=0 


whence H,,(P™) < H,(Q™) + nH,,(P|Q) implies 
HPS a, Q) + H,(P|Q) . (3.9) 


Given a partition P, set Pps := Vier PJ, where r < s and r > Oif T is 
not invertible. Notice that 


then, since u is T-invariant, from 


1 n-1 maT 1 n+s—r—1 
Be V Pie) = m VP) 
n ” V Phs n nts—r " V 


it follows that i (T, Prs) = he (T,P). For instance, s = —r = n gives 


be |e P| She (T, P), Vad. (3.10) 


j=-n 


. As before, set Q = Vy Pi, k > 1; then, P < Ve OFF = a pr 


and (3.9) yield 


1 l : 
pa UP) Sn abr?) (3.11) 
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8. After regrouping viz T= (P) = Vo a T—*i-*(P), from subaddi- 
tivity and T-invariance of u it follows that 


kn-1 n— n—-1 


L(V P) sh (re (Vp) = Laer), 
j=0 i=0 pa 


j=0 


whence letting n — +00 obtains hk (T,P) < hi” (T, P). 


The entropy rate relative to a given partition P of (¥,T,,) strongly 
depends on the latter; for instance, if M is the trivial partition consisting 
only of Æ itself and the empty set, then T~/(N) = N for all j > 0, whence 
hi’ (T,N) = 0. The obvious way of achieving an absolute entropy rate is 
to look for the greatest possible one; this leads to the notion of dynamical 
entropy also known as Kolmogorov-Sinai entropy (KS -entropy) or metric 
entropy (171, 172]. 


Definition 3.1.2 (KS Entropy). The dynamical entropy of a classical dy- 
namical system (X,T, p) is defined as 


KS = KS 
hy? (T) := mpi (T,P) , 


where P is any finite, measurable partition. 


Remark 3.1.2. The dynamical entropy provides a quantity that remains 
invariant under isomorphisms between dynamical systems [61]. Two dynam- 
ical systems (412,T1,2, 41,2) with o-algebra 71,2 are isomorphic if there ex- 


et 


subsets xo C X12 of measure m (æ) = 1 and a one-to-one map 
p: xo = xO) such that 


is 


= 


if S2 = 6(S,) with $, € X, then Sı € 5; if and only if Sy € Xz and 
(S1) = u2(S2), that is u2 0 ® = py and u2 = p1 0 BT! relative to a 

: ay CT, Caen namely, the specially selected subsets x9 must be 
mapped into themselves by the dynamics; 

. 8(Tızı) = Ta8(z1), that is Tz 08 = o T, and 7! o Ta = T, 0d 
relative to A 


N 


w9 


Because of these properties, it turns out that, if (£1,2, Ti ,2, 41,2) are isomor- 
phic, then vie (Tı) = Live (T2). The proof is as follows: if x9 = X12, to 
any partition Pı of 4; there corresponds a partition Pa := (Pı) and vice 
versa, the same being true of the refined partitions p™ that are mapped 


n=1 n=1 
into partitions V o T7 (Pi) = V TZ Î(B(P1)) = PS”. The result thus 
j=0 j=0 


follows since pı pp”) = m PL? => H, (P®) = Hp (P: 
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If %9 C #12, consider a finite, measurable partition Pı = {POP of 
Xı and construct the partition Pz of Xa with atoms pO = oP? N x0, 
i = 1,2,...,p and Py = XX \ HO), Since the latter atom has measure 
(P?) = 0, from the properties of a and the isomorphism @, it turns 
out that H,,,(P§”) = Hy, (P). This gives h¥S (T2) > h¥S (T,); indeed, P: 
is a generic partition of V1, but P2 is not so for 12; the result thus follows 
by exchanging the roles of the two dynamical systems. 

Concluding, two isomorphic dynamical systems must have the same dy- 
namical entropy; since dynamical systems with the same dynamical entropy 
need not be isomorphic, the latter is not a complete invariant [61, 91]. 


Example 3.1.1. [61] Suppose the discrete-time dynamics of (¥,T, p) is 
sampled by observing the time-evolving system not at each tick of the clock, 
rather every k ticks; then 


be) = eh (T) (3.12) 


Indeed, consider Remark 3.1.1.7: since Q depends on P in a specific way, by 
varying P, one does not in general exhaust the whole class of finite measurable 
partitions of V. Then, 


nr) sup ho (e O= khi T 


On the other hand, Remark 3.1.1.3 and P < Q yield 


mn (T P] =15S Ooh Pie Sas 


The technical difficulty of computing the sup in Definition 3.1.2 is over- 
come when there does exist a generating partition P (see Definition 2.3.5) 
such that, together with its images at different times P = T~/(P), it pro- 
vides refined partitions P™ that generate the o-algebra X of X when n —> oo. 


Theorem 3.1.1 (Kolmogorov-Sinai Theorem). If the partition P is gen- 
erating for (X,T, u), then laa (T) = ke (T,P). 


Proof: Consider T invertible (for T not invertible the argument is the 
same) and a generic finite, measurable partition Q; because of the assump- 
tion, using Example 2.4.4, for any £ > 0 one can find an n > 0 and a finite 
partition P < P n,n := Vican Pi, that is a partition generated by finite 
unions of atoms of P_,,,, such that the conditional entropy H „(Q|P) <e. 
Therefore, from (3.9) and (3.10) together with Corollary 2.4.2, one derives 
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nS (7, Q) < DES (T, Pona) + Hy(QIP—nn) < BES (T,P) + Hy(Q|P) 
KS 
eh) Fe 


whence, choosing Q such that hf (T) < h,(T, Q) + ¢, it follows that 


KS KS KS 
be -est Oo) <i (7p) +e 


with £ > 0 arbitrary. 


The following corollaries are often useful for concretely computing the 
dynamical entropy. 


Corollary 3.1.1. Suppose {Pn}nen is a sequence of finite partitions for 
(X,T, u) of increasing finesse, Pn < Pn4i, such that V, Pn = X. Then, 


KS _—_ i KS 
hy? (LT) = lim by? (T, Pn) - 


Proof: Given £ > 0, let Q be a finite, measurable partition such that 
hi (T) < hi (T, Q) + £; from the assumption and Corollary 2.4.2 it follows 


that there exist n € N and Q < Phn such that 
he (T) +e shi? (T, Q) =h?? (T, Pa) + Hp (QIPa) 
< hi” (T, Pa) + Hy(Q|9) 
KS KS 
Sh EPa tesh T] +e: 


A similar argument as in the previous proof can be used to show 


Corollary 3.1.2. Given (X¥,T, u), suppose Xo is a measure algebra that gen- 
erates the o-algebra X of X. Then, 


Examples 3.1.2. 


1. Given two dynamical systems (4;,7;, pi), i = 1,2, their direct product 
(XY, x X,T, x To, 41 X H2) provides a new dynamical system (¥,T, u) 
consisting of two statistically and dynamically independent components. 
Concretely, X := 4, x % is the phase-space consisting of points x = 
(v1, £2), 1,2 E X12 and the dynamics T is such that Tx = (Tı z1, T2£2). 
Furthermore, if X1, 2 are the o-algebras of 1,2, then X remains equipped 
with the o-algebra X = X1 x Xə of measurable sets of the form S1 x Sb, 
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S12 E X42 and with the T-invariant measure on V, u = u1 X u2, defined 
by u(Sı x S2) = uı(S1)u2(S2). Then, [61] 


hs 


Hı Xba ( 


KS KS 
Tı x Tə) = hy u (Tı) + hy, u2 (To) : 


Indeed, X is generated by the measure algebra Up, » Pı x Pa where P1.2 
are generic finite, measurable partitions in V9; thus, “from Corollary 3.1.2 
and statistical independence, 


ne’ (T)= sup hS (T, Pi x P2) 


Pı X P2 


= sup his (Ti, P1) + sup hh? (T2, P2) 
Pi P2 


= he (Tı) + h (To) . 


2. Bernoulli Systems: (see Example 2.1.4) let u be a product measure such 
n—-1 


that p™ (i) = il p(i;). As seen in Example 2.3.3.1, the partition C of 
j=0 

Np into C} = {i E€ Rp : i; € {1,2,... .} is generating for the o-algebra 

of cylinders. Therefore, 


KES (To) = hRS (ToC) = jim *H,(C) 


n-1 p 
. 1 
= ~ lim s `. JI plij) S logp(i;) (i;) 
mea (J= j=l 
p 
= — J pli) log p(s) = Hy (C) . 
i=1 


3. Markov Processes: Let the measure in (2, Te, 1) be given, as in 

Example 2.4.2.2, by p\[™(i™) = plio) | [ p(s léz-1) on 2, Again, 
j=l 

the partition C of the previous example is generating. Therefore, since 


Xi- PCJ) = 1, 


hES (7) = h¥$ (T,,C) = lim 1c) 


n—-oo Th 
1 n—l1 n—1 
=- lim z 5 p(io) II plijlij-1) (plo) II logs plirlir-1)) 
MEQ” j=1 k=l 
p 
=— J pli)plli) logs (jl) - 


j=l 
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4. Ergodic Rotations: Consider the irrational rotations on the T? de- 
scribed by the triplets (T, T a0). As seen in Example 2.3.3.2, there is 
a generating partition C such that 


C e aa 
j=0 j=1 


where the last two equalities follow from the invertibility of the dynamics 
T. Then, as in Example 2.4.4, for any £ > 0, one can find an n € N and 
a partition C < Vat Tİ (C) such that H,(C|C) < e. It thus follows from 
Corollary 2.4.2 that 


H,(¢ V T-4(C)) <H,(CIC) <e, 


whence, from (3.4), hf (T) = 0. This very same argument holds for all 
reversible dynamical systems (x Ae A n) that possess a partition P which 
generates the o-algebra of ¥ as X = V6 Pi, 

5. Non-ergodic Rotations: Unlike in the previous example, there exists 
k € N such that T* = 1, the trivial dynamics with h5 (1) = 0. Then, 
from Example 3.1.1, 0 = hf (1) = hS (T*) = khiS (T). 


KS entropy and Lyapounov Exponents 


In Section 2.2, Lyapounov exponents (see Definition 2.1.2) have been in- 
troduced as indicators of hyperbolic behavior, that is of exponential sepa- 
ration of initially close trajectories. In Example 2.2.2 this has been calcu- 
lated to be log2 for the Baker map, which is isomorphic to a Bernoulli shift 
(22,T5, upg) with a balanced probability measure up; therefore, according 
to Example 3.1.2.2, the Lyapounov exponent equals the KS entropy for this 
system. 

From an informational point of view this fact can be understood as being 
due to the loss of information along the direction where distances and thus 
errors increase exponentially fast [62, 271]. It is therefore plausible to ex- 
pect that all possible expanding directions contribute with their Lyapounov 
exponents to the loss of information and thus to the KS entropy (see Re- 
mark 2.1.3.1). This is indeed the content of the following theorem [199]: 


Theorem 3.1.2 (Pesin Theorem). Let (X,T, u) be a smooth dynamical 
systems as in Remark 2.1.3.1; set A(x) := 5 AO) (x) dimW; (a); then, 
JAD (x) >0 


KS => T WI a 
BST) = f au(e) aCe) 
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When the dynamical triplet (41,7, u) is ergodic, the Lyapounov expo- 
nents, which are constants of the motion, are constant almost everywhere on 
X, whence Pesin’s equality assumes the simpler expression 


KS _ j 
be A Aan. 
FAD >O 
A particular instance of Pesin’s result applied to hyperbolic dynamical 


systems [313, 164] is provided by Example 8.2.4. 


Proposition 3.1.1. The KS entropy of the hyperbolic automorphisms of the 
torus with positive eigenvalues a*! of the matrix A is 


he (Ta) = loga . 


Standard proofs of this result can be found in [164, 279]; here, we prefer 
to defer it to Chapter 8, where it will be obtained by means of a quantum 
dynamical entropy (see Proposition 8.2.7 and Remark 8.2.4). 


3.1.1 Entropic K-systems 


In Section 2.3.1, K-systems have been defined in terms of the existence of 
a K-sequence {X }nez of nested o-subalgebras (see Definition 2.3.4) or of 
an algebraic K-sequence of nested Abelian von Neumann subalgebras (see 
Definition 2.3.6). We will now show that the algebraic characterization is 
equivalent to the following entropic properties, the link being the triviality 
of the tails of all finite partitions (see (2.73)). 


Theorem 3.1.3. /91, 216] Let (¥,T, u) be a dynamical triplet, the following 
ones are equivalent properties: 


1. there exists a K-sequence {Pn}nez based upon a finite generating parti- 
tion P (see Definition 2.3.5); 

2. Tail(Q) = N for any finite measurable partition, where N is the trivial 
partition of X; 

3. for all finite measurable partitions Q of X, 


KS 
hi (T, Q) >0; (3.13) 
4. for all finite measurable partitions Q of X 
“lim _nKS (", Q) = H,(Q) ; (3.14) 


5. for any two finite measurable partitions Q1 2, 


Jim, Hu(Qal V T-*(Q2)) = Hy(Q1) ; (3.15) 


k>n 
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6. for any two finite measurable partitions Q1 9, 


(jim  H,(1| VV T-*(Q)) SiS Q =N. (3.16) 


k>n 


From the characterization of K-mixing by the triviality of the tails of all 
their finite partitions (condition (2) above), Proposition 2.3.5 gives 


Corollary 3.1.3. A dynamical triplet (X¥,T mu) with a finite generating par- 
tition is a K-system if and only if it is K-miaing. 


The key observation in the proof of Theorem 3.1.3 is the continuity of 
the conditional probabilities as stated in Theorem 2.2.1 and the continuity 


of entropies and conditional entropies with respect to their arguments. This 
fact allows us to recast (3.4) in the more suggestive form 


nk (T,P) = lim Hy, eV») =H, PIV») , (3.17) 


where P = T~J(P). Also, by means of (2.73), in (3.15) and (3.16) one 


rewrites 


: =k = P -k 
„ip, Ha (Ql V TO) = Ha (21 „im, V T=) 
k>n k>n 
= H, (Q1|Toil (Q2)) . (3.18) 
We shall also need the following two results [91]. 


Lemma 3.1.1. Given two finite partitions Q1 2, 
+00 
H, (Q11 \/ T7” (Q1) V Tail (Q2)) = BSS (T, Qu) . (3.19) 
n=1 


Proof: As a first step, observe that, given a finite partition Q, repeatedly 
using (2.91) and (3.3) yield 


H, (VV T*(2)|\V To) = S a(o) V T-4(Q)) 
k=1 j=l k=0 j=—k4+1 
+00 
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Then, for fixed € > 0 and sufficiently large n, using Corollary 2.4.2 and 
Definition 3.1.1 one gets 


n—1 


Ha( VTi pE (Q1 V Qo) Vr Q1 V Qə)) 


1 
n 


hy? (T, Q1 V Qa) = 


i 


Li(n) 


n=l 
<in (V meves|V me) 
O 
L2(n) 
n—1 
< =H, (\ THQ: V Q2)) < hS (T, Q1 V Qa) +¢ 
k=0 
Thus „im alr) = im ea, Further, Lemma 2.91 yields 
n—-1 +00 ; 
Li(n) = al V T*(Q:)| VV TiQ v 22)) 
k=0 j=l 
Lyi1(n) 
n=l 
(VY TH (Q1) |V mos (Q2) V V T~4(Q1)) 
j=—ntl 
a T e a E 
Ly2(n) 
n-1 +00 n—1 +00 
La(n) = H,(V T*(Q:)| V T(Qx)) +H, (V T*(Qs)| V T(Qx)) 
k=0 j=l k=0 j=—-n+1 


En oona NE ħi 
Lai(n) L22(n) 
Corollary 2.4.1 implies Z11(n) < Loi(n) and Ly1(n) < Loi (n), then 


Lin) 


n— +00 N n— +0 FI 


By applying (2.91) and then the argument that led to (3.4) one gets 


Liin 
Eas 
n—1 

= lim P ("i an|V renv V T (2:)) 


r=—k+1 
n-1 


= tm Me (a Vi T (Q2) V Nee (21)) 


j=k+1 
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= im Ay laly TI (Q) V V ren) = H,,(Q1| \ Cn) 
n r=1 n>0 


N 
Cn 


+00 
< H,,(Qi|Tail(Q2) v V T-"(Q1)) < HES (7, Q1) - 
r=1 


The last equality follows from the fact that the partitions C,, (not finite in 
general) are such that Cn < Cn-1, whereas for the last but one inequality 
Corollary 2.4.2 has been used and the fact that 


+00 
Tail (Q2) V V T(O)=| A VT OI VVT Oa AG: 


n>0 k>n =l n>0 


Lemma 3.1.2. Given two finite partitions Q1 2, 


Qə < VV T” (Q1) = Tail (Q2) < Tail (Q1) . (3.20) 
nEZ 


Proof: We shall show that all partitions Q < Tail (Q>) are such that 
Q < Tail (Q1), too. Notice that Q < V„ez T” (Q1), by hypothesis. If 


H, (P|Tail (Q1) VQ) =H, (P| Tail (:)) (x) 
for all finite partitions P < \V/;__,, T*(Q1), then by approximating Q arbi- 
trarily well by V/7__,, 7*(Q1), continuity allows one to substitute Q for P in 
(x). Then, (2.91) implies 
H,,(Q|Tail (Q1) v Q) = H,,(Q|Tail (Q:)) => Q ~ Tail (Q1) . 


Equality (*) is proved as follows: a repeated use of (2.91), together with the 
T-invariance of Q (see Remark 2.3.4) and Remark 3.1.1.4, yield 


(WV me] V Trevo) = 


j=n+1 


Li(n) 
= 3 H,(T*(21)| V T-I (Q1) V Q) = 2n Hy CNES #(Q1) VQ) 


k=-n j=—k+1 k=l 


as well as 
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H,( VK | V T-i(Q1)) = 2nH, (2,|\V r 1(Q,)) 
k=1 


k=-—n =n4+1 
eee 
La(n) 


= 2n hS (T, Q1) . 
Since Q is T-invariant, it coincides with Tail (Q), whence Lemma 3.1.2 ensures 
that Lı (n) = La(n). Furthermore, since 


n n n 


P3 V T™Q)= Pv V T@)= V Ti), 


k=-n k=-n k=—n 


by using (2.91) one gets 


Ly(n) = H, (P| V TI (Q1) V Q) 
j=n+1 
———————_—_S>=SS_ 
Li1(n) 


+H,(\ T*(Q1)|P v V T(@) va) 


k=—n g=ntl 
———— [Lalal 
Lio(n) 
+00 . Foo n 
L(n) = H, (P| Vrs (Q:)) + m(V 7 *(Q1)|P v VV T~4(Q,)) 
j=n+1 k=-n j=n+1 
eee 
L2i(n) L22(n) 


Since Ly, (n) < La(n) and Ly2(n) < L9(n), Li(n) = L2(n) gives 


(P| V mevo) =H4(| V rre) 


=n+1 


for all n > 0. Therefore (see the proof of the previous lemma), 


+00 
H,(P| A (V TQ) v Q)) = A, (P| Tail (Q1)) « 
n>0 j=n 
The equality (*) thus follows from Corollary 2.4.2 and 


Tail (Qi) VQ N (V T-I (Q) V Q) so that 
n>0 j=n 
H, (P|Tail (Q1)) >H, (P|Tail (Q1) V Q) 


> H,(P| N (V T~4(Q,) v Q)) = H, (P| Tail (Q1)) - 


n>0 j=n 
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Proof of Theorem 3.1.3 The equivalences will be proved according to the 
following scheme: 


(6) 
(1) => (2): take Qı has the K-partition P, then Lemma 3.1.2 implies 
Tail (Q) < Tail (P) =M for all finite partitions Q. 
(2) => (1): this is the content of Proposition 2.3.6. 
(2) => (8): if hes (T, Q) = 0 for a finite partition Q, by means of (3.17) and 
the argument of Example 2.4.3 extended by continuity 2o non-finite contexts, 
one gets Q < V} T-"(Q). Then, T-*(Q) < V}. T” (Q), for all k > 0, 
whence 


+00 
Tail (Q)= A V re) = V ro) o> QN. 


k>0n>k 


(3) => (2): let Qz be a finite partition with Tail (Q2) # M; Lemma 3.1.2 
applied to Qı < Tail (Q2) yields WS (T, Q1) = 0, whence Qı = N. 
(2) => (5): using (3.18) one gets 


H, (Q1!Tail (Q2)) = Hy(QilW) = Hy(Q1) , 
for all finite partitions Q) 2 (see Example 2.4.3). 
. => (6 ‘ follows from (2) => (5). 
) = (2): suppose Q% is a finite partition; if Qı < Tail (Q2), (3.18) yields 
H 
( 


(Qu Tai (Q2) jj: Thus, Qı = N from (6), whence Tail (Q1) = N. 
(5) = (4): consider a finite partition Q and notice that 


V Tk aN TI” (Q). 
Then, Corollary 2.4.2, (3.17) and (3.7) imply 


„(Q)= lim H, (|V =o) 


n—=+ 


< lim H (alr m(Q)) =bKS g Q) < HQ) . 


n+ 


(4) = (3): given a finite partition Q 4 M, choose £ > 0 in such a way that 
H,,(Q) —e > 0 and n large enough to have Hes (T”, Q) > H (Q) — e€. Then, 
from (3.11) one derives 
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H,,(Q) ZE 


n 


1 n 
h,* (T, Q) > —hy* (gz >i: 


3.2 Codes and Shannon Theorems 


As seen in Section 2.4 communication channels usually comprise a preliminary 
encoding of the source signals. In the following, we shall review some basic 
facts concerning the role of entropy in this context, with particular reference 
to compression of information and its transmission through noisy channels. 


Definitions 3.2.1 (Codes). 


1. A code E : 14+ 925 for a source A with alphabet I4 = {1,2,...,a} is any 
map which associates source symbols i € i4 with strings of any lengths 
consisting of symbols x € Ix = {1,2,...,d}: 


Ta Dim Eli) = 2 = gizz- -En EM, 2; € {1,2,...,d}, 


where 925, denotes the set U,,s1 RP., 

2. A code is non-singular if any two different source symbols i,j € Ia are 
mapped into different code-words E(i) # E(j) € Q3. In this way, any 
code-word corresponds to a unique source-symbol. 

3. The extension of a code E : I4 — (27, to strings 4 = iiio te E a of 
length £ is defined by concatenation: 


RO 51 o E94) = E(k) Elin) +++ Elie) € R3. 


4. A code £ is uniquely decodable if its extensions E are non-singular. 

5. A code E is a prefix or an instantaneous code if no code-word prefixes 
another code-word, that is if no code-word consists in code-symbols added 
to a code-word. 


Examples 3.2.1. [92] 


1. Prefix-codes are uniquely decodable and uniquely decodable codes are 
non-singular. 

2. Let I4 = {1,2,3}, Ix = {0,1}; E(1) = 0, E(2) = 00, E(3) = 01 is a non- 
singular code, but not an uniquely decodable one for €(11) = €(2) = 00. 

3. The code E(1) = 0, €(2) = 01, €(3) = 11, is not a prefix-code as E(1) 
prefixes €(2). However, it is uniquely decodable for the following reason. 
Suppose Eli®) = E(j™) =a): if zı = 1 then x2 = 1 and i1 = jı = 3; 
if xı = z2 = 0, then 7, = jı = 1. Finally, if zı = 0 and z2 = 1 then 
i1 = jı = 1 when x3 = 1, otherwise ių = jı = 2. In this way every string 
of code-words encodes a unique source-word. 
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4. The code E(1) = 0, €(2) = 10, €(3) = 11 is such that no string can 
be prefix to another. Unlike in the previous one, in this case one need 
not check the next symbol in order to identify the corresponding source- 
symbol. 


Prefix-codes are particularly important because the lengths of their code- 
words satisfy the following inequality. 


Proposition 3.2.1 (Kraft’s Inequality). /92] If E : I4 — 23 is a prefiz- 
code over the alphabet Ix = {1,...,d} for a source alphabet I4 = {1,2,...,a} 
and l; denotes the length of the code-word E(t), then 


yoo <i. (3.21) 
i=1 


This inequality is known as Kraft inequality; vice versa, if a set of lengths 4i, 
i=1,2,...,a satisfies (3.21), then there exists a prefiz-code E : I4 —> QQ. 


Proof: The lengths 4; need not be all different; let them be ordered such 
that l1 < lg < +- < lm, M < a and let Nj be the number of source-symbols 
with code-words of length ¢;. Necessarily, Nı < d“, otherwise there would 
be more source-symbols than words of length ¢; that encode them and the 
code would be singular. The prefix condition means that none of the N1 code- 
words can prefix code-words of length 2, whence N; d’?—" code-words are no 
more available and non-singularity implies No < d2 — N, de~t, Continuing, 
N d-” and N; d7% cannot be used as code-words of length 3, whence 


N; < d! —N,d®—% — Nod8-® . 


Iterating the argument one gets a set of inequalities 


j-1 
N; < d“ XO Nedo , LEJ Siy 
k=1 


the last one (j = m) resulting in (3.21). Vice versa, if a set of m different 
lengths 4; satisfy the Kraft inequality, then they also satisfy the inequalities 


J a j-1 
> dea aya <1 => Nj <d” _ NO Ne ds * 
k=1 i=1 k=1 


for 1 < j < m. Therefore, the source-symbols i € I4 can always be regrouped 
into subsets I4(j), each with N; elements, such that there are sufficiently 
many code-words to construct a prefix-code [4(j) 3 it E(i) € Q3. 
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Example 3.2.2. [92] Inequality (3.21) extends to countable prefix codes. 


Indeed, any a”) = (@1,X9,.--,Ln) E€ R can be associated with the in- 

terval Ay := [0.£1£2 -En , 0.L1£2**-En + d~") C [0,1] by means of the 
n 

d-nary expansion © = 2 é =: 0.£1£2-- £n. Therefore, if a countable set 


{xi}ien of code- Py al ) € Ql with lengtls Li have the prefix property, 
the corresponding intervals A; of lengths d~“ are all disjoint and the sum 
of their lengths cannot exceed 1. Viceversa, given a countable set of lengths 
satisfying the extended Kraft inequality 


ye St, 


ieN 
these can be assigned to disjoint dyadic intervals whose left ends can be used 
as code-words of a prefix-code. 


Given a source A some codes will prove more adapted to its statistical 
properties than others; for instance, it is convenient to assign shorter code- 
words to the symbols emitted with higher probability. In this context, a useful 
parameter is the following one. 


Definition 3.2.1 (Average Code Length). /92] Let A be a source emit- 
ting symbols from the alphabet I4 with probabilities 7 = {p(i)}ier,, the av- 
erage length of a code E : I4 —> (2% is defined by Lz(E) := X; pi), 
where l; := €(E(i)) is the length of the code-word E(i) assigned to the i-th 


source-symbol. 


A way to optimize a code relative to a fixed source probability distribution 
is to try to achieve the shortest average length. If € is a prefix-code for 
which (3.21) becomes an equality, the optimal lengths are found by imposing 
that the quantity L,(E€) +A(oi_, d“ — 1) be stationary upon variation of 
the lengths and of the Lagrange multiplier A. Since S>“_, p(i) = 1, one gets 
à* = —logd and ¢* = — log, p(t), whence the corresponding average length 
equals the Shannon entropy in base d, L* = Hq(A). This is the smallest one 
achievable by a prefix code; indeed, with D := Yo d=“: < 1, by means of 
the relative entropy (2.94) and of (2.85), one estimates 


=i" = "pa ) (6; + log, p(t) = YH Joga (ni i) E ) — log, D 
i=1 
= S(T, T) — loga D >20, (3.22) 


where 7 = {d~"/D}%_,. Since 2 is not generally an integer, it cannot be 
directly used to construct an optimal code; however, set ¢; := [— loga p(i)| + 
so that & < 4; < & +1 and 


1fx] denotes the smallest integer larger than 2 € Ry 
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a z a a 
Dat Dri D1. 
i=1 i=1 i=1 


According to Proposition 3.2.1, one can thus construct a prefix-code € with 


average length L,,(€) such that 


H,(A) = L* < 


LE = 5i <I*+1=H,(A)+1. 


These upper and lower bounds also characterize the average code length 


Lr(Eopt) of any optimal code for Lr(E) > Lx (Eopt) > L*. 


Example 3.2.3 (Shannon-Fano-Elias Code). Let A be a source that 
emits symbols 7 € I4 = {1,2,...,a} with probabilities m = {p(7)}%_, and 
assume, without loss of generality that p(i) > 0. Let P(i) := X} p(y); 
then, to each symbol i there corresponds a jump from P(i — 1) to P(i) and 
the value Q(i) := P(i — 1) + p(i)/2 belonging to the corresponding step can 
be used to identify the i-th symbol. Since a code-word must contain a finite 
number of symbols, a suitable truncation of Q(z) is necessary; for this the 
binary expansion of Q(i) is used. Concretely, one assigns to the i-th symbol 
the code-word €(i) = xı (i)x2(i)--- £e (i), where 


— log, p(t) +1 < 4 := [ — log, p(t)] + 1 < — log, p(t) +2 , (3.23) 


and «,(i) € {0,1} are the binary coefficients of the expansion of Q(i) trun- 
cated at the @;-th digit: 


Si a Tl — - a 
a) =O. D H got <a + <r 
j=l j=łi+1 
QW 


Since P(i—1) < Q(i) < P(i), Q(i) provides a code-word €(i) for the symbol 
i of length 4; < — log, p(i) + 2. Also, with the notation of Example 3.2.2, the 
binary intervals 


[0.1 (é)r2(é)--- ae, (i) , 0.21 (4)x2(i)---xe,(i) +27" ] 


lie within the steps corresponding to different 2’s and are thus disjoint. Then, 
E is a prefix-code with average length satisfying 


H(A) < Ln(E) = D> pli) h = So vl) (T= loga p(i)] +1) < Ha(A) +2. 


i=l i=l 
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Remark 3.2.1. The difference between the average code-length and the en- 
tropy H(A) in the case of the assignment ¢; = [ —log, p(i)|, can be elimi- 
nated asymptotically by coding not single source-symbols but whole blocks 
of them. In this case, given a stationary source A and a prefix-code E, one 
encodes strings of length n, i”) € RP, with code-words € (i) E 2% of 
lengths es and average code-length per symbol 


Dipl = = Y pam) 
MEQM 


Then, the same argument developed for codings of single source-symbols 
yields the bounds 


Hy(A™) 


n 


(n) 
(EM) < aa A 


n 


<L 


DER ok 


Taking the limit n — oo, one sees that the average code-length per symbol 
tends to the entropy rate (in base d) ha( A) of the source (see Remark 3.1.1.1). 
This simple result motivates the following interpretation: 

The entropy rate of a stationary source represents the expected number 
of code-symbols needed to optimally describe the whole stochastic process 
corresponding to the source. 


3.2.1 Source Compression 


Storing or transmitting information consumes a certain amount of resources, 
like the number of uses of a channel or the allocation of memory. In order 
to minimize the costs, the strategy is to compress information as much as 
possible in such a way that it could be efficiently retrieved, that is with small 
probability of errors. We shall start with the case of binary Bernoulli sources 
A emitting statistically independent signals (see (2.31)). 

In such a case, the source amounts to a stochastic process {A7 }jez con- 
sisting of independent and identically distributed random variables, each with 
discrete probability distribution 74 = {p()}$_,. Then, the mean value of the 
random variable 


iL,(Aj= == » log p( AÏ) (3.24) 
j=0 


is the Shannon entropy H(A) = 5 p(i™) La(i™), while the variance 
imen 
1 
equals Vn(A) := ((Ln(A) — H(A))?) = —(log” p(A)) — H?(A). 


Lemma 3.2.1 (Tschebitcheff Inequality). Let X be a random variable 
with outcomes i = 1,2,...,d, probability 7 = {p(i)}¢_,, mean value M := 
2 


4=17 


V 
(X) and variance V := (X?) — M?, then Prob{|X — M|>e}< T 
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Proof: The upper bound follows from 


sl oe 
Prob{|X - M|>e}:= So pli< a >X pali- M}. 
iJi —M |>e i:|i— M|>e 


With X := Ln (A) as in (3.24), the previous Lemma yields 
Prob {|Ln(A) — H(A)| > €} < 


Therefore, chosen € > 0 and 6 > 0, for n sufficiently large, one can select high 
probability subsets 


n “(nN n 1 n y(n 
AM = {it ) e 2M: |-=logp™ (4) — H(A)| < e} l (3.25) 
: n 


such that 
Prob( A) >1-ô, Prob((A‘"))*) <ô, (3.26) 


where (ayy = a ee is the corresponding low probability subset. 


Proposition 3.2.2 (Asymptotic Equipartition Property (AEP)). 
For any e > 0 and 6 > 0, there exists Ne5 such that, for alln > Ne 5, the 


high probability subsets AQ C a? are such that, for all i™ € A, 
eT™”(H(A)+e) < pli™) ae. (3.27) 
while, their cardinalities HAL?) satisfy 


(1 — d)enH#A-9 < HAM) < ermtAte | (3.28) 


Proof: The first statement follows from (3.25), while the second one is a 
consequence of (3.26) and of 


1= 5 pa™) > 5 pi) > H(A) eTA) 


iM) ENa Me A™ 
tofe 5 pli™) x #(A™) eTA) -6) ; 
MEAM 


Roughly speaking, the AEP states that, for large n, the binary strings of 
length n can be subdivided into a high probability subspace AG containing 
~ e"4(A) strings each one of them occurring with probability ~ e7"? (4), 


Also, the closer the source entropy to 1, the closer A” gets to Rw. 
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For Bernoulli sources, the AEP amounts to Šip pli™) — H(A) in 
probability. In fact, the AEP extends to ergodic Sortes and, more in gen- 
eral, to symbolic modeling of ergodic dynamical systems, with the Shannon 
entropy replaced by the entropy rate. 

Let P = {P,}?_, denote a finite, measurable partition of a reversible 
ergodic triplet (¥,T, 4) and set P$ := Vier Pi, PI := T-I (P). Further, for 
any x € X let P3(x) denote the atom of the partition P£ that contains x: for 
-almost all x there is one and only one such atom. Notice that each P$ is a 
random variable on ¥ such that 


Pil) = () T(P) = Tise P, Yj=r,r+1,...,5. 
j=r 
Consider now the random variable 


ha(2) = —Ž log u(P8-*(2)) 5 (3.29) 


with the notation of Section 3.1, its expectation is 


1 = 
Mhn) = f anl) =- E f, ule) ogulPE e) 
x n p™ 
EO) CQL am 
a] my 1l (n) 
= 5 H(Pn) og (Pin) = z Hu(P E (3.30) 
EMEQ™ 
Lo, w(P§(2)) 1 0 
Rewrite hn(£) = —— 5 log a log u(P(x)), Po = P, and ob- 


nf © (Py * (2) 
serve that P(x) = P®,(T*x) and PẸ! (x) = Pry (Tx), then 


n—1 


An (a) = i 5 gx(T*x) where (3.31) 
k=0 
0 xy 
nee —iee PU). ate ee om (3.32) 


All these functions are positive; furthermore, 0 < g := lim, gz exists almost 
everywhere and is integrable. In fact, let f% := gp |P, that is 


u(Pa, (x) A P;) 
fiz) = —log MES . 
u(PZ;(2)) 
then, the conditional probability (2.52) of the random variable P conditioned 
on the measure algebra generated by P} reads 
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p(P =i 


P33) (a) =o FO | 


Since, from Theorem 2.2.1, lim, fi exists -almost everywhere, the same is 
true of g = lim, gz. Now, fix a € R and define the following disjoint subsets 
of X: 


M < 
Bam {e ggg le) a< ma) 


Fi =la max filz z) <a < fi(a)} . 


1<j<k—1 


Using the defining property (2.52) of conditional probabilities, one estimates 


Pp 
=3 Par] = al dy (x u(P = 1P) (@) 
i=1 
<e~"y(Fi) and 
love) p oo 
So u(Ex) Se" So (Ù ri) Lpi t: 
k=1 w=1 k=1 


Setting g := sup; gk and Gk := {x : k< gl£) <k+1y, 


=>, du (a <So(k+ Dem < +00 , 
k=0 Gk = 


k=0 


whence g and g are both integrable. 


Example 3.2.4. Consider the case of a bilateral Bernoulli shift as in Exam- 
ple 3.1.2.2. Then, x = 2 € Na, and, choosing as P the generating partition 
C, one gets gx(t) = go(t) = — log „eli )). Therefore, the sum in (3.31) yields 
the time-average of gọ, whence one can apply Birkhoff’s Theorem 2.3.1 and 
ergodicity to deduce that 
lim hn(i) = H,(C) = hi (T,) p- a.e. 

and that the asymptotic behavior pli”) ~ e- Phy (Tx) holds almost every- 
where and not only in probability. 


Despite the fact that, in general, the functions in (3.31) are different for 
different k’s and thus (3.31) is not a time-average as in Birkhoff’s theorem, 
none the less the following result holds. 


Theorem 3.2.1 (Shannon-Mc Millan-Breiman Theorem). 
Let (X,T, u) be a reversible, ergodic dynamical system, then, for all finite, 
measurable partitions P = {P;}?_,, 


lim hala) = he? (T, P) ae. 
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Proof: —[61, 199] With the notation introduced in the preceding discussion, 
dominated convergence, T-invariance of u and (3.30) together with (3.31) 
yield 


n-1 n-1 
ea ok ea Wh 
H(g) = lim (gn) = lim =) (ge) = lim = $ (ge o T") 

k=0 k=0 

= lim p(hn) = nKS (T, P) . 
n-1 
On the other hand, from ergodicity, saa (T,P) = w(g) = lim A ` g(T*x) 

k=0 


H — a.e., whence the theorem is proved by showing that 


n-1 
lim — —g)(T*x) =0 — a.e. 
AA g)(T* x) p—a.e. (x) 


Consider G y(x) := sUPk>n |gk(x) — g(x)|; these functions are integrable and 
limy Gy = 0 p-a.e., thus, from ergodicity, 


n—1 n—-1 


i . 1 k 
lim sup X (or — g)(T* x)| < lim sup A 5 |(g9k — g)(T* x)| 
1 n—1 
< limsup — Gy(T*x) = (Gy) 
n—oo N 
k=0 


pi-a.e. and for all N € N whence (x). 


Remark 3.2.2. The Shannon-Mc Millan-Breiman theorem applied to an er- 
godic source allows a reformulation of the AEP in terms of the KS entropy. 
Indeed, choosing as P the standard generating partition as in Example 3.2.4, 
almost everywhere convergence of p(s) to e7” A), ensures that, given 
c > 0, and 6 > 0, for n sufficiently large, the ensemble of strings of length n 
can be subdivided into a high probability subspace A” of probability ~ 1 
containing ~ e” (A) strings. 


The AEP allows the implementation of the following compression scheme 
of an ergodic binary source: one considers strings of length n, makes a list 
of those contained in a high probability subset A” and assign them as 
a code their position in the list. Since A”) contains less than 2”("(4)+«) 
strings (entropies being conveniently computed with logarithms in base 2), 
the number of bits needed for the encoding is at the most 


[log, 2442) ] 41 = [h(A) +e] +1, 
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while the strings belonging to the complementary set (Af )© may be encoded 
by a same integer, say #(A™) + 1. Upon retrieval, the strings belonging to 
A” are exactly identified by their code, but not those in (AS Je: however, 
since Prob((A\”)2) < ô and 6 — 0 with n — ov, the larger n gets, the lower 
is their probability of occurring. Therefore, the probability of error can be 
made vanishingly small by increasing n. 


Theorem 3.2.2 (Noiseless Coding Theorem). Let A be an ergodic bi- 
nary source with entropy rate h(A): binary strings of length n can be encoded 
by using nR < n bits and vanishing probability of error if R > h(A). If 
R < h(A), then the probability of error goes to 1 with n > co. 


Proof: The first part of the theorem follows from the previous discussion 
by applying the equipartition theorem with R = h(A) + €. 
For the second part, let R = h(A) — e and consider the high probability 
(n) 


subset AY together with its complement (A; 12)" The probability of any 


subset B of a containing |2""| ? strings can be estimated as follows, 
Prob(B) < Prob( B A (A\))°) + Prob(B n 463) 
2G a: gnRg—n(h(A)—«/2) = 9.4 g—ne/2 


where ô is a vanishingly small quantity given by the AEP . Thus, listing the 
strings belonging to a subset as B, one uses less than h(A) bit per bit , but, 
when n gets larger, the probability that an emitted string belong to B gets 
vanishingly small and the probability of error close to 1. 


Universal Source Codings 


The compression protocols discussed in the previous section depends on 
the knowledge of the source statistics. Interestingly, encoding and decoding 
schemes exist which work equally well, namely with a same compression rate 
R, for all ergodic sources A with an entropy rate h(A) < R, whatever their 
overall stationary probability distribution: these protocols provide universal 
source codings. 

In the following, we shall consider Bernoulli sources [92], while the general 
case can be found discussed in [325, 168]. The method used is based on the 
concept of type. 

Let A be a stationary Bernoulli source emitting strings 4 = ijiz -in € 
R, ij € IA = {1,2,...,a} according to compatible probability distribu- 


tions TA) = fapa) = Ilj- pali). We shall denote 


?|a| denotes the largest integer smaller than z € R. 
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1. by N(jjé™) the number of times j € I4 occurs in the string i”); 
N Ghi ™® 
2. by pljli™) = NE) the so-called empirical probability generated by 
n 
the string i™ and by My) = {pli the corresponding empirical 
distribution. The latter is known as the type of i™: strings i) whose 
symbols occur with same frequencies belong to a same type I); 


3. by Pn the set of all types 7”; 
4. by T(IZ™)) the subset of all strings i” € 2”) with a same type MO. 


The construction of universal codings is based on the following two 
bounds; of particular importance is the second one which states that the 
number of different types increases at most polynomially with n. 


Lemma 3.2.2. Let IT € P, be a type of R and let H(I) be its 
Shannon entropy. The number of strings in TUII™) and the number of all 
possible types fulfil 


HITT) PEE, HP) < (n+ 1)" 
Furthermore, the a-priori probability of T(II™)) is such that 
KOTO) < rS sn), 
where S(IT™ , n4) is the classical relative entropy (see (2.94)). 


Proof: Let P\ be the following empirical probability distribution on ae. 


PO (4) = To jMyNGIE™) = = JJ aie) log, p(jli™) _ g—n HUT n)) | 
j=l 


The probability of the type class T(J‘) is certainly smaller than 1; thus, 


1> PO(THT™) = E POEM) = (PUT) 2-H ™ 
MET (IT) 


yields the first estimate. 

The second is a very loose upper bound: each type I/;(n) is entirely char- 
acterized by how many times each symbol i € I4 occurs in i"). Without 
constraints (that can only decrease the number of types) there are n + 1 
choices for each i = 1,2,...,a, namely 0, 1,..., n, whence the result. 

Finally, the last bound is derived as follows: first, notice that 
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NCEE) np(lla™) logs pa (j) 
(4) 2 


j=1 e=1 e=1 


oe = 


| 
— 
3 
= 
© 
i 
= 
RS 
D 


a „(n s(n „(n AKAS] 
O yr Ei (pa logs p(ele™) — p(€lé™) logs ate) | 


= 97n H ny) + SUL n) 7 4)) 


Then, using the first upper bound, 
mC) = So pa) 
METI) 


= #(T(™)) g-m(HUT™) + SUE ,2a)) < gon SE yea) 


Because of the first bound in Lemma 3.2.2, at most nR+1 bits are needed 
to encode the label of a string i'” of type ™ with H(II™) < R, while at 
most alog,(n + 1) + 1 bits ensures the encoding of the label specifying the 
type P to which the string belongs (the +1 accounts for R and log,(n+1) not 
being integers). Therefore, in the limit n — oo, one expects a compression 
rate R for all Bernoulli sources with H(A) < R. 


Definition 3.2.2 (Universal Codings). Let R > 0 and consider an en- 
coding of a Bernoulli source A into binary strings of length |nR|, given by 


EPs QA” > RPP, followed by a decoding procedure D : ink > R., 
This gives a universal (n,2”®)-code if the probability of error 


PO) = 70 (a : D” o E” (i™) Z sm) 


goes to 0 when n — œ and E€”, D” do not depend on the Bernoulli source 
probability TA. 


Proposition 3.2.3. There exist universal source codings (n, 2") for every 
Bernoulli source with H(A) < R. 


logs(n + 1) 


Proof: Given R > 0, let Rn := R—a ; using the first two bounds 


in Lemma 3.2.2, the subsets A™ := {i € 2M : Hg) < Rn} have 
cardinalities such that 


O) EPpy A EPn 
H(A) )<Rn HUI) <Ry 
< X gn Rn < (n+ i)? gnRn = gnk . 
O) EPn 


H(I(™)<Rn, 
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Let E” associate to strings in A their label in the list of such strings 
expressed in bits and let D” be its inverse map. If H(A) < R, then, using 
the third bound in the previous lemma, 


pm =1-7® (A) = Sm (rr) 


AM) EPn 
H(I ))>Rn 


< (n+ 1)° max{ a (rar) : H(I) > Rn} 


-nming S(™ na): H(I rea} 
-1)°2 { ( A) ( )> 


Since lim, Rn = R and H(A) < R, and the relative entropy S(m1, 72) = 0 


iff mı = 72, Po... gets exponentially small for n sufficiently large. 


3.2.2 Channel Capacity 


Noiseless channels are an exception; usually, during transmission signals get 
distorted. It can thus happen that a channel outputs a same string y™ 
when presented with different input strings xf” and a” which cannot then 
be decoded without errors. Like in compression, to counteract distortion one 
resorts to suitable encoding and decoding procedures of longer and longer 
strings; however, unlike in compression where redundancies are eliminated, 
in the presence of noise, the strategy is to introduce redundancies in order to 
lower the possibility that different input strings give rise to a same channel 
output. 


Example 3.2.5. In Example 2.4.1, bits 0 and 1 can be converted into one 
another with probability 0 < p < 1/2 by a binary symmetric channel C. The 
probability of a wrong decoding can be lowered by encoding 


€(0)= 00-0 , E(1)= 11-1 
Sa ow SHY 


2n+1 times 2n+1 times 


Then, 2n +1 uses of the channel output strings ¢@"T) := C2@"+) o €(i) that 
can be decoded by a majority rule: let Nj 2n+1)(0) denote the number of 0s 
in ¿”+D then 


“(Qn+1)\ _ 0 if Njen+y (0) >n 
De =i if Nzenty(0) <n 


By such an encoding-decoding procedure one transmits one bit at the cost 
of 2n + 1 bits; an error occurs, that is D o C@"+) o €(i) 4i if >n+1 
bits of E(i) are flipped by the channel C. The probability of such an event, 
H +1 


i )ora—py vanishes with n — oo; unfortunately, the transmission 
n 


3.2 Codes and Shannon Theorems 99 


1 
rate, that is the number of bits transmitted per use of the channel, mT 
n 
vanishes, too. 


In the following we shall consider channels C without memory and without 
feedback such that each of their uses is independent of the previous inputs 
and outputs. Further, n uses of the channel C amount to a single use of a 
channel C™ which maps input strings x™ € IX consisting of n symbols from 
an alphabet Ix = {1,2,...,nx} into output strings y™ € I? consisting of 
symbols from an alphabet Iy = {1,2,...,ny}. Input and output strings are 
conveniently described as realizations of stochastic processes {X;}ien and 
{Yi len with join random variables X™ := Vi] X; and Y™ := Via Yj- 
The transitions 2) œ> y occur with probabilities p(y |a™) that fac- 
torize (see (2.82)) and are thus completely characterized by the single-use 
transition probabilities p(y;|xi). 

One of the great achievements of early information theory was obtained 
by Shannon who proved that codes exist such that the number M(n) of 
distinguishable strings «) increases with n at a non-zero exponential rate 
R: M(n) = 22", 


Definition 3.2.3 (Channel Codes and Capacity). /92/ 
A code (M,n), for a channel C consists of 


1. a set Ig := {1,2,...M}; 

2. an encoding E : Ig ++ I® associating a code-word x) (w) = E(w) to any 
of the indices w € Ic; 

3. a decoding procedure D : I} ++ Ic, D(y™ (w)) =: ù € Ic, that returns 
Ô € Ic given a channel output y™(w) = C"(a™(w)). 


log, M 
2 


The rate of the code is defined by R := 3. The probability of an error, 


Ô = Diy (w)) + w, is 


en(w) = 5 p(y a” (w)) . 


y ERL: Dy Zw 


The rate is said achievable if there exists a sequence of codes (2"®,n) with 
vanishing maximal probability of error En := MaXwerc En(w). 
The capacity C of the channel C is the largest of its achievable rates. 


Remark 3.2.3. For a memoryless channel, (2.82) holds; thus, if the probabil- 
ities of the input stochastic process {X;};cn factorize, so do the probabilities 
of the output stochastic process {Y; bien: 


3For sake of simplicity, in the following M = 2”” will be identified with [2"”], 
the smallest integer larger than M. 


100 3 Dynamical Entropy and Information 


prow) = X (ye) pwa) 


a) ere 


= IID yj|æj) px (x5) TL yj) - (3.33) 


=1 2; 


Shannon’s result is that the mutual information (2.93) I(X;Y) is an achiev- 
able rate and that the channel capacity is given by 


C = max I(X;Y). (3.34) 


Examples 3.2.6. 


1. Example 2.4.1.1: pg (i) = pa (i), i = 0, 1, implies I(A; B) = H(A), whence 
capacity, C = 1, is attained at m4 = {1/2, 1/2}. 
2. Example 2.4.1.2: with H(p) := —plog, p — (1 — p) loga (1 — p), 


I(A; B) = H(B) )+ Dora (i) XC p(jla) logs pili) = H(B) — H(p) , 
whence capacity C = 1 — H(p) is attained at 74 = {1/2,1/2}, since 


pB(0) = pa(0)(1—p)+pa(1)p pa(1) = pa(0)p+pa(1)(1—p) = = 


_ 1 

=i 

3. Example 2.4.1.3: pp(1) = pa(0)(1—a), pg(2) = pa(1)(1—a) and pp(3) = 
a(pa(0) + pa(1)) = a yield 


1(A; B) = H(B) — (pa(0) + pa(1)) H(a) = H(B) — H(a) 
=(1-a)H(A), 


whence capacity C = (1 — a) is attained at m4 = {1/2,1/2}. 

4. The capacity in (3.2.3) refers to only one use of the channel C; consider 
now the channel C™ acting on w") e I} with outputs y™® € I}. The 
mutual information [(X‘"); Y) of the corresponding random variables 
X™ and Y can be controlled by repeatedly using (2.93). From (2.82), 


H(Y™|X™) = H(X v Y™®) — H(X™) 
= H(Y,|X™ v Y®Ð) + ol vr). H(X™) 
=X A(Y;|X™ v YUD) = HexD) 
j=l 


Further, from (2.88), 
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I(x®™; y™) = H(y™) = H(Y™|x™) 


<P (HY)- HYX)) <nC. (835) 


Therefore, if C(”) denotes the capacity of the channel C™), the supremum 

over all input probability distributions gets C(™ < nC. Actually, from 

Remark 3.2.3, equality is achieved by choosing a factorizing Tyn) such 
n 


that pyiny(a™) = [[2x(), with mx the one achieving capacity C. 
j=l 
Then, the output probabilities factorize too and thus H(Y(")) = nH(Y). 


The above relation between capacity and mutual information can be un- 
derstood as follows. As showed in the last example, if X ") consists of n 
independent, identically distributed repetitions of X, then the same is true 
of Y™ and X™VvY() with respect to Y and X VY. With H(X), H(Y) and 
H(X,Y) the corresponding entropies, based on the AEP , for large n there are 
roughly 2°" (*) my-typical inputs, 2”# Y) zy-typical outputs and 2”7(*,Y) 
jointly typical pairs (a, y™), that is typical with respect to txyyy. Of 
course, not all input-output pairs (a, y(™) with a mx-typical and y™ 
my-typical are jointly typical: this happens with probability roughly equal to 

9nH (X,Y) 


ee g-nl(XiY) 
QnH(X) 9nA(Y) ` 


Therefore, in order to encounter a jointly typical pair with fixed output y™ 
one needs at least 2"!(**") inputs; in other words, one expects that encoding a 
number of input strings smaller than 2"7(*;¥), none of them should be jointly 
typical with respect to a same y. Vice versa, more than 2”/(*:¥) inputs 
would start having a same jointly typical output and thus being not exactly 
identifiable. Memoryless channels with independent, identically distributed 
inputs are thus expected to have achievable rates R ~ I(X;Y). 

In order to give a mathematical proof of the above intuitive argument, 
we start by extending the notion of typical strings. 


Definition 3.2.4 (Jointly-typical Strings). 


Two strings x™ € IX and y™Me I% are jointly typical if they belong to 
the subset A!” C I% x [¢ such that 


1 

|= logy px» (a!) = H(X)| <E 
1 

|= log py» (y™) — H(Y)| < € 
1 

|-> logs pxeva (el, y) a A(X vY)| LE; 


whereOQ<e< 1. 
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Since (2.82) holds, the argument of the proof of Proposition 3.2.2 gives 
rise to a jointly typical AEP. Namely, let € > 0, for sufficiently large n’s the 
probability carried by subsets of strings violating any of the inequalities in the 
previous definition can be made smaller than €/3 so that Prob( m) >1—e. 
Moreover, its cardinality fulfils 


(1 _ 6) gn(H(X,Y)—e) < #(AM™) < on(H(XVY)+e) , (3.36) 


while the probabilities of strings 2, y™ and (#™,y™) satisfying the 
inequalities in Definition 3.2.4 fulfil 


g-MHX)+8) < pyn(g™) < 27H X)-0) (3.37) 
HY) < pyn(y™) < 2H Y)-0) (3.38) 


Q-MH(XVY) +6) < Pen (a, y) L ce a0 =A. (330) 
Then, Prob({(2™, y) € AmI) = 5 px» (a) pyn(y™) 


(x) yr) eA 
can be bounded from below and above as follows: 


(l-e) g—n(L(X;Y)+3e) < Prob{ { (£, y) = Am) < g—n(l(XsY)—3e) — 
(3.40) 


Theorem 3.2.3 (Shannon Noisy-Channel Theorem). 
All rates R < C, C as in (3.84), are achievable and any sequence of codes 
(nR,n) with the maximal error probability en > 0 must have R < C. 


Proof that e,, —> 0 > R < C : Suppose the signals w € {1,2,...,M}, 
M = [2"F], encoded by (nR,n) into E(w) = a2 e€ I®, are equidistributed; 
let W denote the random variable with outcomes w. Using (2.93), (2.95) with 
C(B) = €(W) = X™ and (3.35), it follows that 


nR < log, M = H(W) = H(wiy™) af r(w; mn) 


< H(wiy™) + I(x; y) < H(wiy™) +nc. 


We need now connect H (wiv) to the error probability: this is done by 


means of the so-called Fano’s inequality. By assumption the maximal error 
probability in Definition 3.2.3 goes to zero with n, so does the average error 
0 w=w 
= — Ea 
M ae wAw 


variable determined by W and Y), Thus, using 2.91, 


1 
probability ef" := en(w). Let E := { ; E is a random 
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H(wiy™) = (zw, Y ®) +H(wiy) 
=H(wie,y™) + H(z). 


Now, from Remark 2.4.3.4, H(z|y) < H(E) < 1. Further, E = 0 implies 


that W is determined by Y™ so that H(W|E = o, y) = 0, whereas if 
E = 1 then the cardinality of possible values of W is M — 1. Therefore, 


H(WIE, yO) = Py Prob(£ NH (WIE = =i ym) 
= en log, (M = 1) < eg” nR => H(wiy®) <1+ e2” nR. 


The result follows since nR < 1 + e&) nR + nC implies ek”) Sl =a = g 
) nua 


which in turn implies that es” cannot vanish with n — oo if R > C. 


The proof of the first part of Theorem 3.2.3 relies on the following steps: 


fa 


. for w € {1,2,..., M = 2"), choose the code-word a) (w) at random 


with probability p{” (æ) = IIi- Pin(xi). This gives a random code of 


M n 
type (nR, n) with overall probability Prob(€ =] II Pin(ai(w 


w=li=l 
2. choose the symbols w at random with the same probability p(w) = M1; 
3. if C™ (x) = y™ and there is only one ù such that E(w) = a (w) is 
jointly typical with y“, then associate with y the symbol w, otherwise 
declare an error. This gives a decoding map y\”) + D(y™) = @; 
4. an error is also declared if D(y™) = @ 4 w and C"(E(w)) = y™. 


Proof that (nR,n) is achievable when R<C: oe ee Nw ) be the 


probability of an error relative to a random code £ and e\ YE ) the corre- 
sponding average error probability. Further, let 


= >. Prob(£) e™(E n 5 Prob(E Jek (w) : 
E 


w=1 € 
this is the average error probability over all randomly generated codes. Then, 
every w gives the same contribution to the error, so P(e) = $e Prob(€ ye (1) 
with fixed w = 1. Let Fy := {(2™(w), y™) e Ay}, where A‘) is a jointly- 
typical subspace. According to the rules of the game, if y° = C” (x™(1)), 


a decoding error occurs when 


1. (a&™(1),y™) ¢ A‘), that is when the input corresponding to w = 1 
and the relative output are not jointly typical; 
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2. (a (i), y™) € F; for i Æ 1, that is when the output corresponding to 
w = 1 is jointly-typical with code-words associated to w Æ 1. 


The overall average error probability can thus be estimated as follows: 


P(e) = Prob ((Fi) (F,)° ULF i) < Prob(( Z Pror 


i=2 


By the jointly-typical AEP , Fy C AM => Prob((F1)°) < e for n large 
enough. Further, because of fandomie of the code, the y x) (i), i #1, 
are statistically independent from a‘")(1) and y™ = C” (æ) (1)). Then, the 
jointly-typical AEP also yields 


Y Pravin (M _ 1) 2 n(I(X;Y)—3e) < 2 n(I(X;Y)—R-—3e) . 


If R < I(X;Y) — 3e, the latter quantity gets < e for n sufficiently large 
and thus P(e) < 2e. This implies that there exists at least one code €* with 
el) < 2e. By choosing for X the distribution 7* attaining capacity in (3.34), 
the condition for achieving the rate R becomes R < C. Finally, at least half 
of the code-words a ")(w) of €* must have e™ (w) < 4e otherwise el”) > 2e. 
Keeping only these ones, changes the rate from R to R(n) := R — 1/n. The 
procedure thus yields a sequence of codes (nR(n),n) such that e( — 0 and 
R(n) > R for al R < C. 


Bibliographyical Notes 


Most of the results about the KS entropy have been drawn from [61]; other 
excellent books on the subject and its applications are [17, 91, 164]. In par- 
ticular, the completeness of the KS entropy for Bernoulli systems is discussed 
in [91]. The book by [199] is a reference for Pesin’s theory and the relations 
between the KS entropy and Lyapounov exponents (see also [106]). 

In [62] one finds an extensive review of the role of KS entropy and Lya- 
pounov exponents as regards the issue of predictability in continuous and 
discrete dynamical systems. In [18], the notion is discussed in relation to the 
broader notion of complexity. 

The material on coding and compression has largely been drawn from [92, 
254, 266). 


4 Algorithmic Complexity 


One of the intuitive notions which is most elusive from a mathematical point 
of view is that of randomness. Consider a string i”) € R3 emitted by a 
Bernoulli source with probabilities po4; suppose that n >> 1 and that the 
number of 0s, (0), is nearly half the number of 1s, n(1) ~ 2n(0). One expects 
that, generically, the relative frequencies n(i)/n tend to the probabilities p; 
with increasing n; indeed, only special, that is intuitively non-random, strings 
should fail such a statistical test. Therefore, one would call i™ random only 
if pp = 1/3 [305]. Of course, passing the frequency test is not enough; indeed, 
if pp = 1/2, both i consisting of n/2 subsequent pairs 0, 1 and a string j™ 
of Os and 1s distributed without any evident pattern occur with probability 
27”, However, because of its regularity, i™ would be called non-random and, 
vice versa, because of the absence of regular structures, j™ would be called 
random [92, 310]. 

Presence and absence of patterns seems to be a useful clue to defining 
which strings or sequences are random and which are not so; this property 
should somehow be related to the degree of compressibility so that one might 
wonder whether the entropy rate introduced in Section 3 could provide a 
natural measure of randomness. Also, by replacing the entropy rate with the 
dynamical KS entropy, one could define a classical dynamical system to be 
random or not on the basis of the compressibility of the best ones amongst 
its symbolic models. However, entropy rate and the KS entropy describe the 
average behavior of sources or of dynamical systems and say nothing about 
individual strings or individual trajectories. 

Various attempts have been undertaken to tackle the problem of formal- 
izing the intuitive notion of randomness of individual sequences 2 € Nə. 
In [305], three relevant approaches are discussed: in the first one, randomness 
is identified with stochasticness, that is with the impossibility of devising a 
winning strategy when bets on the value of the next symbol in of 4 € 3 are 
based on the knowledge of iiiz ---in—1. In the second approach, randomness 
is identified with chaoticness that is with the absence of regular patterns in 
i € MQ. In the third approach, randomness in a sequence 2 € Nə is identi- 


F. Benatti, Dynamics, Information and Complexity in Quantum Systems, 105 
Theoretical and Mathematical Physics, DOI 10.1007/978-1-4020-9306-7_4, 
© Springer Science+Business Media B.V. 2009 


106 4 Algorithmic Complexity 


fied with its typicalness, that is with the fact that it does not belong to any 
effectively null subset of Q3 1 

In the following we shall focus on the second approach which is also 
known as algorithmic complexity theory, and was developed independently 
and almost at the same time by Kolmogorov [173, 174], Chaitin [77] and 
Solomonoff [283, 284] in the early sixties. Algorithmic complexity theory 
involves as many subjects as mathematics, logics, computer science and 
physics [310, 73, 254]: we shall give a short overview of some of its aspects that 
are relevant for an extension of this notion to quantum dynamical systems. 


4.1 Effective Descriptions 


The main step towards a theory of randomness of individual strings was the 
observation that regular strings admit short effective descriptions, whereas 
irregular strings do not. By effective description of a (binary) target string 
it is meant any algorithm (binary program) that is computed by a suitable 
computer and makes it halt with the target string as output. 


Example 4.1.1. Any string i” = i,i2-+-in consisting of n bits can always 
be reproduced by processing the program 


PRINT iiz-in, 


which specifies the bits to print, one after the other. 

This program amounts to the literal transcription of the target string. 
Clearly, one has to seek more clever ways to describe i”), that is shorter 
programs. In doing so, one is much helped by the presence of patterns; if 
ij = 0 for all 1 < j < n, the following simple program could be used: 


1 Let 923, the set of all binary sequences, be equipped with the o-algebra gener- 
ated by cylinder sets and with the uniform product probability distribution so that 
any cylinder C; indexed by a string i € 23 of length length (i) has probability 
n(Ci) = 2-*., A subset A C Q3 is a null subset if for any £ > 0 there are cylinders 
Ci, ij € 22 such that A C U, Ci, and Z; 274G) < e, A subset A C Rž is an 
effectively null subset if the previous inequality is satisfied with the strings 2; that 
index the cylinders and £ > 0 (any rational number) both effectively computable 
by a suitable algorithm (for instance by a program processed by a computer) [305]. 
Intuitively, random sequences cannot be effectively reproducible and thus cannot 
belong to effectively null sets. Concretely, these latter sets consist of non-typical 
strings and correspond to effective statistical tests or Martin-Lof tests that, when 
failed, identify these non-random strings (an example is the frequency test men- 
tioned in the discussion prior to this remark) [310]. In other words, a sequence is 
random according to the typicalness criterion if it passes all Martin-Lof tests. On 
the other hand, if typicalness were defined with reference to all possible null sub- 
sets, then there would be no typical sequences; indeed, any i € {22 belongs to the 
null subset of (22 consisting of the sequence itself. 
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PRINT 0 n TIMES. 


For large n, the length of such a program goes as logs n, that is as the number 
of bits necessary to specify the length of the string ¢(i) = n. This is also 
the case if, less trivially, the string i™ consists of a same pattern, i that 
repeats itself ~ n/q times. Indeed, what is to be specified is the length of 
the pattern at the cost of a fixed number, log, q, of bits and the number of 
repetitions at the cost of ~ log, n/q ~ logs n bits for n > q. 

On the other hand, if ¿™ shows no pattern, there is no shorter effective 
description than literal transcription. In this case, the length of the effective 
description grows as n and not as logs n. 


In the previous example, it is clear that one is interested in the shortest 
possible effective descriptions s(i‘”)) of a given string i: let C(i™) denote 
the length of any of these shortest description, that is 0(s(é™)) = C(é™). 

The map i’ + s(i™) is code for the ensemble of strings of length n. 
In Section 4.3, it will be showed that, by processing the effective descriptions 
by means of particular computing devices called prefiz machines (in which 
case C(i\”) is denoted by K(i\”))), the code becomes a prefix code (see 
Definition 3.2.1), so that the extended Kraft inequality (see Example 3.2.2) 


applies 
So et. (4.1) 
iER3 


Example 4.1.2 (Payoff Functions). [120,310] Suppose the government 
of a country claims that in the j-th one of n successive elections it won with 
0.997; percent of the votes, ij being any decimal digit for j odd and the 
j/2 digit in the decimal expansion of m for j even. To defend itself from 
the accuse of fabricating the electoral results, the government replies that 
the probability Q(i™) = 107” of such a string of decimal digits i™ = 
iii2 +*+ in is equal to that of any other string randomly obtained according 
to the uniform probability distribution over 10 symbols. This defense can be 
defeated by using the regularity of i\” to construct a suitable payoff function 
t(i™|Q) > 0, namely a non-negative function whose mean value is such that 


S| wma" |Q) <1. 


EME QM 


Its meaning is as follows: the accuser proposes the government to be payed 
t(i™ |Q) upon betting 1 on the outcome i). This is a fair proposal for, if 
the outcomes é) are distributed according to the uniform probability Q, the 
accuser average gain cannot be higher than 1. 

However, if there is a pattern in i, the accuser can construct a payoff 
function t(i™ |Q) that assumes high values on the strings with such a pat- 
tern. Concretely, for the half of the decimal digits of i” that are randomly 
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distributed according to Q, one needs n/2 log, 10 bits for its description; in- 
stead, for the remaining half that comes from an algorithm that computes 
successive approximations to 7, a finite number, C, of bits ? suffice. Then, 
one gets the following upper bound to the length of the shortest effective 
description computed by a prefix machine (see previous remark), 


K(a™) < 5 log, ae, 


Setting t(i™ |Q) := 27 182 Q@™)—KE™) L 107 27KE™) one defines a payoff 
function; indeed, because of (4.1), 


5 Q(i™) 271982 Qi) —KG™) _ 5 g-KG™) oe 


MEQ EM EQM 


While any fair Casino’s owner should accept bets based on such a payoff 
function, the government cannot; indeed, by betting 1 on the digit of each 
one of n successive elections, the accuser will pay n to the government but 
receive 10”/?2-© from it, quite an amount of money for large n. As the 
payoff function does depend only on the presence of a pattern, but not on its 
particular form, the accuser strategy does not require any a priori knowledge. 


The aim of algorithmic complexity theory is an objective characterization 
of the randomness of individual strings in terms of the lengths of their shortest 
effective descriptions. It is thus necessary to eliminate the dependence of such 
lengths on the computers that process the corresponding programs. Indeed, 
given a same target string i™ two different computers %1,2 will in a 
provide shortest descriptions sı,2(i ()) with different lengths C1 (a )), As 
explained in Proposition 4.1.1, this problem is overcome by resorting to ef- 
fective descriptions processed by universal computers, namely by computers 
that are able to simulate the action of any other computing machine. The 
universal computers on which classical algorithmic complexity theory is based 
are the so-called Universal Turing Machines (UTMs ). 


4.1.1 Classical Turing Machines 


A Turing Machine (TM ) is a very basic (and abstract) model of computing 
device (see [310]) consisting of 


1. a bi-infinite tape T subdivided into cells labeled by integers i € Z, each 
cell containing either a blank symbol # or a symbol o from a given 
alphabet X. We shall set X = X U #; 


?This number becomes negligible when n increases. 
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2. areading/write head H moving along the tape that, when positioned on 
the i-th cell, reads the symbol g; € X, leaves it unchanged or changes it 
into o; € X and then proceeds to either the cell i + 1 to the right (R) or 
to the cell i — 1 to the left (L); 

3. a central processing unit C (CPU) capable of a finite number of control 
states qi E€ Q := {q0, 42 ---, 4 Q|-1}: at each computational step, the CPU 
state q E€ Q may remain the same or change into q’ € Q. 


The list of possible moves defines a program for the TM ; formally, it 
amounts to a transition function 


6:Q@xVReQxUx{L,R}, d(¢,0)=(d,0',d), de {L,R}. (4.2) 


As a consequence, any TM can be identified by the set of rules defining 6. Each 
set of rules, that is any TM , corresponds to a certain task, a computation, 
to be performed on an input data string. Any computation can be assumed 
to start with the CPU control state in a chosen ready state q,, the head 
positioned on a chosen 0-th cell and the input written on a finite number 
of cells extending from the 0-th one to its left, while all other cells to the 
left and to the right contain blank symbols. The computation then proceeds 
through a sequence of steps dictated by the transition function 6, each one of 
them corresponding to a certain configuration of the TM that performs it. 


Definition 4.1.1 (TM configurations). At each step of a computation 
a classical configuration c of a TM Y is a triplet 


C>c:= (a {oi}iez:k) EQXIŽXZ, 


where in the infinite sequence {o;}iez of cell symbols only finitely many of 
them are such that o; 4 #, while q,k denote the state of the control unit and 
of the head position and C the set of all configurations. 


In order to determine when a computation terminates, we assume that 
among the control states there is a special state, qf, such that when the 
control unit is in the state qf, then the output is read off from the position 
of the head to its right until the last o; 4 #. 

Because they consist of a finite set of rules involving finite sets of symbols, 
transition functions (and thus TMs ) can be encoded and numbered. Given a 
program p (or the TM which computes it), its number +(p) in the enumeration 
of all programs (or TMs ) is known as Gédel number of p [93]. A universal 
Turing machine is any TM U which, upon receiving the code of a TM %, is 
able to simulate U on any input string. 


Example 4.1.3. There are many possible ways to encode a transition func- 
tion 0; a simple one is as follows [128]: the control states q; and the symbols 
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gj are identified by giving their positions 7, j in the respective lists Q and X. 
These are then encoded as strings of as many 0’s: 


qi= 0 := 00-0, a0 :=00-:-0. 
—S —’” a 
i times j times 


Thus, the rule 6(q:,0;) = (qk, ce, d) can be encoded as 0'10/ 10% 10°10", 
where the 1s are used to separate the various entries (only sequences of 0s 
are entries corresponding to labels). These appear one after the other as they 
do in the given rule, while n(d) = 1 if d = L, n(d) = 2 if d = R. Then, 
the transition function (or, equivalently, the TM Y that performs the task 
specified by it) can be encoded as 


1 0!@l 11 o!#1 11 0% 107108 104%107%) 11 
—— or_ 
ist rule 
0 1072102 101074) 11 
— —_—$<$<——_— - 


2nd rule 


om 10» 10%» 10% i or 111, 
a 
last rule 


where the first two strings of Os encode the total number of control states 
and of symbols, the pairs of 1s separate the rules, while the first and last 1 
mark the beginning and the end of the list. 


Suppose f : N — N is a function from the integers to the integers; by 
passing to the binary representation of n € N, f becomes a function from 
25 +> 3. It is called total if its domain of definition is the whole of (25 
(symbolically, f(é”) | for all i € Q%), partial otherwise, namely if there 
exist strings i” on which f is not defined (symbolically, f(é”) f on these 
strings). The existence of an algorithm or an effective procedure which allows 
one to compute f provides an intuitive and informal definition of computable 
functions; among others, a possible formalization of computability is as fol- 
lows [93]. 


Definition 4.1.2. A partial function f : Q3 ++ Q3 is said to be computable 
if there is a Turing machine that on input i E€ 25 outputs f(t). 


The so-called Church-Turing thesis asserts that the intuitively and in- 
formally defined set of computable functions coincides with those that are 
computable according to the previous definition [93, 128]. It is not a theo- 
rem, yet it could not be disproved as a conjecture; therefore, it is commonly 
accepted that the TMs provide a computational model which computes all 
what can be thought of being intuitively computable. 
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Remark 4.1.1. Given a computable partial function f, if pf is one of the 
(infinitely many) programs which compute it, one can assign f the Gödel 
number (ps) of pr which is one of the (infinitely) many Gödel numbers of 
f [93]. It follows that the computable functions form a countable set; this fact 
allows the use of Cantor’s diagonal argument to construct a total function 
f: 23 > Q3 which is not computable. In order to show this, consider the 
enumeration as ¢; : N + N of all computable partial functions f : N+ N 
that can be constructed by choosing a definite Godel number for each one of 
them. Then, the function defined by 


— fon(n)+1 if O(n) 1 
M= {AET E Sry 


is total as @ | on all inputs. Furthermore, it cannot coincide with any @; for, 
if ġ; is defined on j, then ¢(j) = i(i) +1 4 lj). 


Example 4.1.4. An important class of TMs are the Probabilistic TMs 
(PTMs ) which provide a more powerful classical model of computation than 
TMs [128]. They are defined by transition functions of the form 


6:QxUxQx Tx {L,R} [0,1] (4.3) 


(q,0;q,0',d)++ d(q,o;q',0',d)€ [0,1], XO d(g,o;d',0',d)=1. (4.4) 


q',a',d 


Namely, PTMs are defined by assigning the probabilities 0(q,0;q',0', d) with 
which the machine goes from a CPU control state q E€ Q and symbol read 
a € X to a new control state q’, new symbol g’ together with a subsequent 
head move d € {L, R}. Therefore, given a starting configuration c; E€ C the 
machine will move to a new configuration c; € C with a certain transition 
probability pij := p(c; — cj), the successors of c; being all those cj with 
Pij # 0. The transition probabilities satisfy ` jPij = 1; indeed, given a 
starting configuration c;, the PTM will surely move to a subsequent one 
among those available to it. Each step performed by a PTM will then be 
described by a transition matriz m = [pij]. 

Any computation performed by a PTM on an initial configuration cg 
can be seen as a tree whose nodes are the successor configurations and the 
branches connecting the leaves carry the relative non-zero transition proba- 
bilities. Each run of the machine defines a tree-level with its corresponding 
nodes; if a successor configuration at level 7 appears more than once then the 
probability of its occurrence at that level is the sum of the probabilities lead- 
ing to it through the various branches. As a simple instance of such a mech- 
anism [128], consider an initial configuration co branching into two different 
configurations cj; and cj at level 1 with probabilities po, := p(co > c11) 
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and poz := p(co —> C12): por + poz = 1. During the second step of the com- 
putation, the two configurations at level 1 branch into two configurations 
each: c11 into c21 and c22 with probabilities pı := p(ci1 —> c21), respectively 
pig := p(c11 > C22), such that pi; + pig = 1, while c12 branches into c23 and 
C24 with probabilities p23 := p(ciz2 — C23), respectively pos := p(ci2 > c24), 
such that p23 + p24 = 1 (see Figure 4.1). Thus the probabilities of the four 
configurations are 


p(ca1) = Pol P11 ; p(c22) = P01 P12 , p(c23) = P02 P23 , p(C24) = Po2 P24 - 


If c22 = c23 = c* then the probability of c* is p(c*) = p(c22) + p(c23). 


Co 


C21 C22 C23 C24 


Fig. 4.1. Probabilistic Turing Machines: Level Tree 


Remark 4.1.2. Within the class of PTMs , TMs are deterministic in the 
sense that the corresponding probabilities 6(q,0;q',0’,d) equal 1 when the 
couples (q, o) and triplets (q’, a’, d) are connected by the rules (4.2), otherwise 
d(q,0;q',0',d) = 0. The computations performed by TMs correspond to de- 
terministic classical processes, while those of PTMs correspond to stochastic 
classical processes (compare the ballistic and Brownian computers discussed 
in [51]); in other words, it is the laws of classical physics upon which the 
models of computations embodied by TMs and PTMs are based. 

PTMs are important from the point of view of the so-called computational 
complexity 3 [128, 165]. All computational tasks need a certain amount of 
time to be performed and use a certain amount of memory (space); roughly 
speaking, computational complexity theory estimates how the amount of time 
and/or space required to perform a computation involving n bits scales with 
n: if the time required to process n bits goes as n°, a > 0, one says that the 
computation has polynomial computational complexity, otherwise superpoly- 
nomial or exponential. When a computer 4 simulates another computer Y 
that performs a certain task, there is an unavoidable overhead in space/time 


3To be distinguished from the descriptional complexity. 
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resources due to the simulation. The latter is then called efficient if the over- 
head scales polynomially with respect to the space/time resources used by 
V. The Classical Strong Church-Turing Thesis [165] states that: 


Any realistic computational model can be efficiently simulated by a PTM . 


Namely, any computational model which is consistent with the laws of 
classical physics and which accounts for all necessary computational re- 
sources * only requires a polynomial space/time overhead to be simulated 
by a PTM . As the Church-Turing thesis (see Remark 4.1.1), also the strong 
Church-Turing thesis has survived all attempts to disprove it; however, as 
observed by Feynamn [116], this paradigm does not seem to be extendible to 
computational models based on quantum mechanics, for then classical physics 
appears unable to simulate their performances as efficiently. 


4.1.2 Kolmogorov Complexity 


In the following we shall restrict to the effective description of binary strings; 
using the notation of the previous section, we shall therefore consider TMs 
with the alphabet X = {0,1} U#. Further, (p) will denote the length, that 
is the number of bits, of a program p written as a binary string and U(p) the 
result of p being processed by a TM 4. 


Definition 4.1.3 (Kolmogorov Complexity). The Kolmogorov complex- 
ity [92, 310] or plain algorithmic complexity of i € 2) is the length of the 
shortest binary program p such that U(p) = ir) 5 


Cy(i™) := min{ (p) : U(p) = i i 


Plain algorithmic complexity is thus seemingly related to the most efficient 
way individual strings can be compressed; indeed, by the previous definition, 
no effective description of a given string i™ can be shorter than programs 
with length equal to its algorithmic complexity C(é”). 


Remark 4.1.3. Unlike computational complexity (see Remark 4.1.2), algo- 
rithmic complexity is not concerned with the space/time resources needed to 
process certain programs, but only with their lengths, without restrictions on 
time and memory. {From the algorithmic point of view, only random strings 
are interesting, while those with simple effective descriptions are somewhat 


‘The adjective realistic refers to the fact that the time and space resources 
effectively needed should be explicitly declared [165]. 

°We shall conform to the notation of [310] which uses the letter C for the 
Kolmogorov complexity and K for the prefix complexity (see Section 4.3). 
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dull, despite the large amount of resources that may be needed to compute 
them. Indeed, there might be short effective descriptions that require a very 
long time to yield their targets, as for instance the DNA-encoding of human 
beings [310]. The attempts to fill this gap by considering algorithmic and com- 
putational complexity together has led to the notion of logical depth [310]. 


Proposition 4.1.1. The following properties hold: 


1. The plain algorithmic complexities of a same string i with respect to 
two different UTMs W 2 differ by a constant which does not depend on 
the string, but only on the UTMs . 

2. The plain algorithmic complexity is upper bounded as follows 


Cy) < A+ eG™) =Atn, (4.5) 


where A is a constant which does not depend on i”), 
3. The number of strings i™ e Q™ with plain algorithmic complexity 
strictly smaller than c > 0 © is bounded by 


afi En” . Cy(i™) < c} <2]: (4.6) 


Proof: The proof of the first statement follows from the fact that U can 
simulate {5 and vice versa, for both are assumed to be universal. Given i™, 
let př be such that Cy, (é™) = (pt) and let Pig be the program, of length 
€(Pi2) = Liz, which allows Us to simulate U. In order to make Uy simulate 
{U on the input pj, the programs P2 and pj must be put together in way that 
U knows when the simulation instructions end and the string to be processed 
starts. This is achieved by concatenating Pı and př as q = p}3(Pi2), where 


i” = iriz in > B(G) = iyiqigig-++ininO1 


is the encoding of a string obtained by repeating each of its bits twice and 
marking the end with a the pair of different bits 01: for this encoding one 
needs £(G(Pi2)) = 2 (Li2 +1) bits. In this way Up will first read 3(Pi2) being 
thus able to simulate U on the subsequent portion pj of the program q. 
Therefore, from the definition of plain complexity, it follows that 


Cy, (i™) < L(a) + A < €(p*) +2 (L12 +1) + A < Cy, (i) + Arr . 


Reversing the roles of W 2 one gets Cy,(i™) < Cu, (i™) + A2; thus, 
[Cu (i™) — Cy, (#™)| < A, where A is a suitable constant which does not 
depend on the input i). 


®If c is not integer, c is to be understood as |c], the largest integer not larger 
than c: |e] <e< [c| +1. 
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The first upper bound follows as in Example 4.1.1, from the effective 
description which tells 4 to print the bits of i” one after the other. 

The second upper bound follows because the number of binary programs 
with length smaller than c equals the number of binary strings with |c| — 1 
digits at the most, whence 


[e]-1 
Hp: Up) <ct}= >> af sale -1 <21. 


j=l 


Example 4.1.5. In order to improve the loose upper bound (4.5), given 
i™ € R, let k be the number of 1s among its bits; there are (%}) strings in 
QA) sharing this feature. They can be listed and each of them identified by 
its number N, (i) in the list; notice that no more than [log (7)] bits are 
required to specify Nz (i). One can thus construct an effective description 
of i”, by specifying k and N;, (i) in such a way that the UTM must be 
able to detach the specification of k, pp, from that of N; (i). For this, one 
may do as in the proof of Proposition 4.1.1, by encoding pp as (pp), the 
binary string obtained from pp by repeating each of its bits twice and mark- 
ing the end by 01. Since, ¢(G(p,)) < 2 (log, k + 1), from Definition 4.1.3 it 
follows that 


CA) < logy o +2(logə k + 1). 


The following upper bound holds [92], 


n k 
< gn Hala) 
(ern. 


k k k k k 
where H2(—) := logy — log, — (1 logs) logs(1 — — logy), which can be 
n n n n n 


derived by setting p= k/n in 
nm J J n k n—k 
= a 1-—=+ > 1- <k<l1. 
i a ( 3 > (iat le 


1 e bekti 
Thus, O(a maya 2822+ 
m nr 


n 
prefixes, that is the initial n bits, of infinite binary sequences 4 € Nə. Let 
0< p< 1 be the probability of the bit 1; if k/n +> p, then 


cé™) 
n 


. Consider now the strings i to be 


< Ho(7) . (4.7) 


lim sup 
where H2(7) is the (log) entropy rate of a Bernoulli binary source with 
probability m = (p,1 — p). The upper bound in (4.5) is thus not a loose one 
for p close to 1/2. 
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Example 4.1.6. One would expect the algorithmic complexity of a pair (2, 7) 
of strings i,j E€ 92} to be smaller (apart from the usual additive constant 
independent of them) than the sum of the algorithmic complexities of ¢ and 
j, namely: 


C((i,j)) < Ci) + CG) +C. 


Intuitively, this should be so because one can always put together the shortest 
programs p, respectively q for i, respectively 7, in a program pq which is an 
effective description of (i, j). Unfortunately, the plain algorithmic complexity 
cannot enjoy the form of subadditivity expressed by the previous inequality. 

Indeed, if p,q are two programs such that C(i) = &(p) and C(j) = &(q), 
then any program using p and q to output the pair (i,j) must separate p 
from p, for instance by prefixing p with its length ¢(p) encoded by (6(¢(p)) 
(see the proof of Proposition 4.1.1) at the cost of 2(log (p) +1) extra bits. In 
this way, the reference UTM Y first computes p generating 2, then computes 
q, generating j and finally outputs (i, j). Thus, one estimates: 


C((4,9)) < L(B(E(p))p) + La) + Co 
< C(i) + C(j) + 2log, &(p) + C1 , 


where Co, are additive constants independent of the strings considered. 

The log, (p) extra bits cannot in general be avoided by reducing it to 
an additive constant independent of the input string. Indeed [120, 310], let 
(i) = n, (j) =m and set k := n + m; there are (k + 1)2* pairs (i, j) such 
that the concatenated string ij € RP., By setting c = (k + 1)2* in (4.6), one 
gets that at least one pair (i, j) of such strings satisfies 


C((i,j)) 2 k + loga(k + 1). 
Then, using (4.5), from k = n + m = L(i) + (J) it follows that 


C((4,9)) > CG) + CG) + loga(k +1) - C. 


Remarks 4.1.4. 


1. Since the algorithmic complexities of i™ with respect to two UTMs is 
a constant independent of the string, one can fix a UTM U once and for 
all and drop the reference to it in Cy(i™). 

2. The additive constant A in (4.5) can in line of principle be very large; 
however, since it is the same for all target strings i , it becomes less 
and less important with increasing n. The additive constant can even be 
got rid of if, as in Example 4.1.5, one considers infinite strings 2 € Qo, 


their prefixes iM e a and let n — oo in the algorithmic complexity 
cu) 
. 


per symbol 
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3. The bound (4.6) shows that the one in (4.5) is not too loose for large n. 
In fact, the fraction of strings i with complexity smaller than n — c, 
0<c< n, can be estimated by 


# {am ERM : C4) <n- c} 


ae 
an < 


Therefore, when n gets large, the number of strings with complexity sig- 
nificantly smaller than n gets small. 

4. In view of the previous remark, it is suggestive to define random those 
sequences i € Nə such their initial prefixes i™® fulfil C(é™) > n—c 
for all n € N, where c is a constant independent of n. Unfortunately, 
the vary same reason why the plain complexity is not subadditive (see 
Example 4.1.6 makes this definition not very useful [310]. Fortunately, as 
we shall see in Section 4.3, by using prefix TMs to compute programs 
one replaces the algorithmic complexity ci) with the so-called prefix 
complexity K(i™) and, in so doing, restores subadditivity and makes 
K(i™) > n—c forall n € N a good definition of random sequences 
i € M [310]. 


Non-Computability of C(i™) 


Algorithmic complexity is not computable; namely, there cannot exist an 
algorithm 7 able to compute the C(i™) for all strings. Indeed [254], if such a 
program q of length €(q) < co existed, then, one could construct the following 
program p: 


e Step 1: let io equal the empty string; 

e Step 2: generate the k-string 2, in the lexicographically ordered set of 
all binary strings, call for q and compute C(i;); 

e Step 3: if C(z,) > (p) write ip and halt else set k = k + 1 and 
go to Step 2. 


Since q, the program which computes the plain complexity of any input 
string, is assumed to exists, p also exists. Moreover, it has finite length 0(p) 
and halts with the first binary string, say ix» in lexicographical order, as out- 
put. Since the its plain complexity exceeds ¢(p), p is an effective description 
of ip» that is strictly shorter than its shortest possible effective description, 
which is a contradiction. 


Remark 4.1.5. [268] The non-computability of C(i‘™) implies the undecid- 
ability of the halting problem, namely that there cannot exist an algorithm 
able to decide whether a UTM {4 halts when processing a generic program 


TA TM according to the Church-Turing thesis. 
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p. Indeed, if such an algorithm existed, then one could compute ci”) for 
all i), Effectively, one would proceed by generating the binary strings in 
lexicographical order (each one of them is a program) and subsequently pro- 
cessing them in dovetailed fashion [310, 92]. That is, at stage 1, step 1 of 
program 1 is effected, at stage 2, step 2 of program 1 and step 1 of program 
2, at stage k, step k of program 1, step k — j + 1 of program j, 1 < j < k, 
and so on. At the N-th step, there will be three groups of programs, 


those that have halted with U(p) = i™; 
those that have halted with U(p) 4 i; 
those that are still being processed. 


Notice that in the third group there might be shorter programs than those 
which have already halted. Let p* be one of the shortest in the first group. 
One cannot set C(é'”)) = &(p*) because it cannot be excluded that a program 
p in the third group, shorter than p, will halt later with U(p) = i”. However, 
if the halting problem could be decided, then one would exactly have this vital 
piece of information and, waiting long enough, would have a means to find 
the shortest one among those programs such that U(p) = i”. 


In spite of the fact that the plain complexity is not computable, the 
previous remark provides a means to effectively approximate it from above; 
namely, one can construct a sequence of functions C; that can be computed 
by a UTM 4 on any binary input string i™ and get closer to C(i™) with 
increasing n [120]. Let U(p) denote the output of the computation by U 
of a program p that halts in t steps. By processing in dovetailed fashion 
the programs of length (p) < t, one can check whether during the first t 
computational steps some of them has halted with output i”), in which case 
one sets 


C,(i™) := min{&(p) < t: Up) =i™}, CO, (™) = +00 otherwise . 
Finally, with reference to the loose upperbound (4.5), let 
C, (i) := min{C,(i™) , n+ A}. 


The function C;,(i) can only decrease with increasing t; moreover, from 
Definition 4.1.3, C,(i™) > C(i™) so that it tends to the plain complex- 
ity of a”) monotonically from above. One says that the plain complexity is 
semi-computable from above. Notice that, although we know that the approx- 
imating values C; (i) tend to C(i) from above, yet we do not know how 
far from the actual value C(i™) any given C,(é) might be. 


Definition 4.1.4. A real function f on 23 is called semi-computable from 
above if there exists a non-increasing sequence of functions {frbren on 23 
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with rational values ë such that they are computable in the sense of Def- 
inition 4.1.2 and limps fx(t™) = f(é™). A real function f on Q% is 
called semi-computable from below if —f is semi-computable from above. A 
real function f on NZ is computable if it is semi-computable both from above 
and below. 


Remarks 4.1.6. 


1. The difference from semi-computable and computable functions can be 
understood as follows. If f is computable then there exist two mono- 
tone sequences of rational-valued computable functions {f; PY EN, ff 
non-increasing and f? non-decreasing, such that 


(m)) l a,b(4(n) 
FG) = lim fo). 
It follows that one can always estimate, for any 27 € $23, the distance 
between the computed values f; °(é) and the actual value f(i) by means 
of the computable difference ff(i)— ft(i). 

2. The approximations fp(i) of a function f(i) semi-computable from below 
can be seen as the result of a same program (binary string) pr. When a 
reference UTM {Y is presented with py, together with the binary repre- 
sentation i(k) of k and an input string i € 923, it computes fp(i), that 
is U( (pr, ilk), i)) = feli), where (pf, i(k), i) is the binary string which 
encodes and separates the various inputs. Consequently, as well as com- 
putable functions also semi-computable functions can be enumerated. 


An interesting class of lower semi-computable functions consists of the 
so-called constructive semi-measures [120, 310]. 


Definition 4.1.5. A positive function u : 23 — R is called a semi-measure 
if Viens w(t) < 1 and a constructive semi-measure if it is semi-computable 
from below. A constructive semi-measure m : N3 ++ R is called a universal 
semi-measure if for any constructive semi-measure u there exists a constant 
C, such that 

Cy, w(t) < m(i) Vi E€ R3. 


Working with semi-measures jz instead of measures allows for more free- 
dom; for instance constructive measures turn out to be automatically com- 
putable. Namely, if fk is a non-decreasing sequence of rational-valued com- 
putable functions that approximate u from below and J;e o; ua) =i 
one can construct a computable approximation 0 < uk < u such that, given 
e > 0, Diens mali”) > 1- e. Then, for all i € 923 it holds that 


8 Any p/q, p,q € N, can be written as a binary string (p,q) € 23. 
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\n(é) — weld) < 37 TORT TOETS 


Example 4.1.7. [120, 310] Constructive semi-measures can be enumerated 
(see Remark 4.1.6.2); let {un} denote their list and let {a(n)}nen be lower 
semi-computable positive numbers such that `, a(n) < 1. Then 


m := X a(n) ün > O(k) un V br - 


n 


m is thus a dominating semi-measure, it is also constructive and thus uni- 
versal in the sense of Definition 4.1.5; indeed, there exists a two-argument 
lower semi-computable function u(i™,n) that reproduces all constructive 
semi-measure by varying n € N. The idea of the proof is as follows. Given a 
lower semi-computable function f and a non-decreasing sequence of rational- 
valued approximations fp, let pp the binary program that allows a reference 
UTM {4 to compute them as outlined in Remark 4.1.6.2 and let {i;}j;en be 
the lexicographically ordered list of all binary strings. By computing them in 
dovetailed fashion, let then U} be the computable function defined by 


l U(r ilk), ij)) if lij) <k 
Up, a) = { oa 0 otherwise 


Notice that U% — f when k — +00; then, consider the recursive effective 
procedure consisting of the following steps: 


1. set up, (ij) = 0; 

2. set k= k+ 1; 

3. compute 21, %2,...,%% in dovetailed fashion; if some Ur. (ij) has not halted 
go to Step 5, else compute De UF. (i); 

4. if D; Up, (ij) < 1, set up, := U}, , and go to Step 2, else 

5. set uk, := ukr! and stop. 


By construction, the function u(i™ , pr) := limk +o uE i™) is lower semi- 
y H f + pf 


computable and a semi-measure; further, it coincides with f if the latter is 
itself a constructive semi-measure. 


Algorithmic Complexity and Thermodynamics 


Beside its many mathematical applications, algorithmic complexity has also 
been used to explore the relations between computation and thermodynam- 
ics [54, 51, 52, 268, 310]. As already remarked in this section, computing is a 
physical process and questions about its thermodynamic cost is surely of prac- 
tical importance, but also of general interest as they amount to asking which 
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computational steps are intrinsically irreversible and which ones can instead 
be performed reversibly [51]. As nicely illustrated in [268], trying to answer 
these questions brings together thermodynamics, computability theory and 
Godel incompleteness theorem. 

The starting step is the observation [187, 51, 116] that the only irreversible 
computer operations are intrinsically logically irreversible, namely those with 
outputs that do not uniquely identify the input. The most obvious instance 
of such operations is erasure and, as an oversimplified case, consider one 
molecule of gas contained in a cubic box of volume V in which a freely 
moving piston can be used to confine the molecule on the left side of the box, 
a case which is read as a bit 1. The flip operation which turns 1 into 0 can 
be effected reversibly by slowly rotating the box around its vertical axis and 
thus exchanging its right and left sides. 

In order to erase these two bits of information, the piston can be let 
loose so that free expansion (of one molecule) allows the molecule, which was 
confined in a volume V/2 before, to wander later within the whole volume V. 
If the process occurs isothermally at temperature T, the loss of information 
corresponding to the increase of the space at disposal corresponds to an 
increase in thermodynamical entropy and decrease of free energy (the internal 
energy does not change in isothermal processes): 


AS = k log2, AF = AU —-TAS=-—k«T log2. 


By extrapolating this simple observation, one is naturally led to the identifi- 
cation of free energy and free memory: one can consume free memory to store 
data instead of erasing them and in this wave saves free energy, or, vice versa, 
by consuming free energy in erasure processes one saves free memory [268]. 

Differently from erasure which can in no way be turned into a reversible 
operation, all other operations are only superficially irreversible and can be 
made reversible by adding enough supplementary information [51]. For in- 
stance binary addition maps the pairs (0,0) and (1,1) into 0 and pairs (0, 1) 
and (1,0) into 1. Therefore, by reading off 0 (1) one cannot decide which cou- 
ple of bits was the input; however, conserving the inputs and writing them 
together with their outputs turns the binary addition (®) into a reversible 
operation: 


080=0 (0,0) ++ (0,0,0) 
0i =T (0,1) + (0,1,1) 
1060= >’ (1,0) (1,0,1) 
161=0 (1,1) & (1, 1,0) 
eR" 

irreversible reversible 


Unfortunately, the redundant information that is used in order to make op- 
erations reversible has to be stored and this occupies free memory so that 
massive erasure operations are eventually needed, free energy consumed and 
heat waste generated. In order to minimize free energy consumption, one can 
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first proceed to reversibly compress as much as possible the stored informa- 
tion to be erased. For instance, in the case of the binary addition, one can 
use only the first input bit since the second one can be recovered by binary 
subtraction (©) from the output bit: 


(0,0) => (0,0) , 060=0 
(0,1) (0,1) , 160=1 
(1,0)4(1,1) , 161=0° 
(1,1)4(1,0) , 061=1 


still reversible 


Suppose the occupied memory consists of a binary string i”), then the best 
compression achievable is given by the shortest binary program p* such that 
U(p*) = i™ whose length is the Kolmogorov complexity cia”). Reversibly 
encoding i™ into p* and erasing the latter entails the optimal loss of free 
energy op F = —«T C((é™) log 2 to be compared with AF = -ns T log 2. 

These considerations suggest [326] that, when dealing with the thermo- 
dynamics of computation, the notion of entropy should be improved by the 
addition to the standard thermal contribution, Stn, of the one coming from 
the optimal erasure of the memory 


Scomp = Sth, + kC(M) log 2 ; 


where C(M) is the algorithmic complexity of the computer memory. For in- 
stance, by using Scomp, the Maxwell’s demon paradox [190] can be solved by 
observing [326] that Stherm can indeed be diminished by the demon collect- 
ing together all fastest particles and transferring heath from lower to higher 
temperatures. However, storing all the information necessary to comparing 
particle velocities rapidly consumes free memory and asks for erasure thus 
restoring the second law of thermodynamics. 

Unfortunately, the main problem with optimal compression is that it is 
based on the knowledge of the algorithmic complexity of the occupied memory 
which cannot always be computed. In few words, performing an optimal 
compression of the memory content before erasure is not always possible 
and there will always be an excess of free energy consumption. As this is 
ultimately due to the undecidability of the halting problem, this effect can 
be suggestively and not unduly called Gödel friction [268]. 


4.2 Algorithmic Complexity and Entropy Rate 


Despite Remark 4.1.4.4, there is a sense in which the Kolmogorov complexity 
can be used to look at the individual trajectories of a classical dynamical 
system (¥,T, u) and at their randomness, namely through their asymptotic 
complexity rate. As explained in Section 2.2, a partition P of Y provides a 
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symbolic model (Qp, To, up) whereby trajectories are reduced to sequences 
iE Q C Np of symbols from an alphabet with p letters of which one can 
study the complexity of the prefixes i™ € QD) 3, 

As for the Shannon entropy, when dealing with sequences, one may decide 
to focus not on the Kolmogorov complexity which generically diverges, rather 
upon its rate or complexity per symbol [7, 69]. 


Definition 4.2.1. The complexity rate of a sequence i € Qy is given by 


1 
c(i) := lim sup c(i) ; 


n—co 


(n 


where i” is the initial prefix of i of length n. 

Given a dynamical system (¥,T, u) and a finite, measurable partition 
P of X, let i(x) E€ Np denote the symbolic trajectory that P associates to 
the trajectory {T"x}n>0 issuing from x € X. Then, the complexity rate of 
{I"rhn>o0 with respect to P is c(x,P) := c(i(x)). 


To start with, we shall consider the case of a dynamical system which 
is itself already a symbolic model, namely a binary information source. An 
important result is that, typically, for sequences emitted by ergodic sources, 
the bound (4.7) becomes an equality in the limit. 


Theorem 4.2.1 (Brudno’s Theorem). Let (22,T,,7) be a binary ergodic 
source with entropy rate h(t). Then, 


c(i) = lim Loa) =h(r) , (4.8) 


for almost allt E€ N2 with respect to 7. 


The proof [69, 318, 166] consists 1) in using the counting argument (4.6) 
and the AEP (Proposition 3.2.2) to show that 


lim inf PCG > h(n) T — ae ; (4.9) 


n=œ n 


and 2) in providing, for the initial prefixes i™ of m-almost all i € Q3, an 
L(pi(n 
appropriate binary program p;(n) € 923 such that lim Kes) < h(t) and 
noo n 
U(pi(n)) = i whence 
°In order to do this, one has to extend Definition 4.1.3 to the case of strings of 


symbols from generic finite alphabets. This is straightforward and will always be 
understood in the following. 


124 4 Algorithmic Complexity 


1 
limsup —C(i™) < h(r) mae. (4.10) 
n 


Proof of the lower bound : Because of the assumption of ergodicity, The- 
orem 3.2.1 allows us to use the AEP with the entropy rate h(7) in place of 
the Shannon entropy H(A). Let AS”) C RP” be the set in (3.27) consisting 
of binary strings ¢ such that 


DAHA < phi) < reao 
and A‘”) C M the set of sequences whose initial prefixes of length n, i” 


belong to AS”) and have complexity C(i™®) < n(h(m) — 2e). From (4.6), it 
follows that 


n( â) — n({a E A™ : CC) < nla) -29}) 
< #(4™) - max q(i®™) 


ime A™ 
< gn(h(r) 2e)+1 | 2 n(h(m)—e) — go-netl 


ys < n(h() — 2€), it 


Since strings i) ¢ AS” may also have complexity C(i"” 
is necessary to control their overall probability. Set (A ers RA AM and 


AW) = {ie (AM): CG) < k(A(m) — 26}, BM = U Aw , 


Since A® c (ÂP): implies r(B™) < z( U (A) i =1- “(I N Ål , 


>n 
it follows that the probability of the set of sequences whose initial ected 


have complexity C(i™) > n(h(m) — 2e) is estimated from above by 


Ak) y Al’) A(k) (n) 
(Uae vA}) < (U49) +2(#”) 
—netl 


< So akett 4 7( BW”) < = + 1-7((f) Ag) 


k>n k>n 


The set Neva A® consists of sequences 2 € Nə whose initial prefixes are 

typical for all lengths k > n; therefore lim n( N AW) = 1. It thus follows 
k>n 

(n) 

ieee 


n>n n 
lower bound follows. 


> h(a) — 2e z-almost everywhere. Since € is arbitrary the 
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V5 4) = ijiz sin, fixO< Len 
and consider all strings of length L made of consecutive bits of i: there are 
n—L+1 of them: 


Proof of the upper bound : Given oF 


Sk i= Upto tp+k-1 ; 1l<k<n-D41. (x) 


Let ie denote their set and let N(s) be the number of occurrences of the 
string s € D N (s) can be expressed as follows. Let i € R be any sequence 
with initial prefix of length n equal to i, then 


n—-L+1 


N(s)= Do xT), œ) 


j=0 
where T, is the left shift and y,(T/(z)) is 1 if the initial prefix of length L in 
Ti(z) equals s, 0 otherwise. 
Given the N(s), s € RE, one can thus construct a so-called empirical 


probability distribution E on os 
N(s) 
To = {POS}, Ps) = —— T 
with corresponding Shannon entropy 
L 
H (myn) = — J p(s) log, p(s) . 


(L) 
sEN in) 


Notice that the set of lengths ¢(s) := [—log, p)(s)] is such that 
— logs pi” (s) < &(s) < — loga p(s) +1; (*#) 


therefore, they satisfy the Kraft inequality 


5 gee) < 5 pP (s) =. 


(L) (L) 
SEN tn) sEQ in) 


Because of Proposition 3.2.1, there thus exists a binary prefix code over the 
strings s € a. consisting of codewords w(s) of lengths (s) := €(w(s)). 

With sẹ as defined in (*) above, for a given 1 < j < L—1, consider the 
adjacent strings of length L of the form s;+),1,0 < pj < pj*“". Since the first 
bit of s; is 7; and the last bit of Siteperl is 154(pmee41)L— 1; then the bit not 
belonging to any Sj+p;L are iiiz: -ij and 154+ (pmar 41) D4; +(p mar 41)D41°** ns 
whence 


n-j-L+1 


jt (OP + Y)L-1< n= ppm < T ; 
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for a total of no more than 2(L — 1) bits. Also, since any 1 < k < n can be 
written ask = j + pL with 1 < L — 1 and 0 < | ie determined, 
then, for different 1 < j < L—1, the sets S; := ETE ls 


and US) SS). 


im) 


pi=0 » do not overlap 


Consider a program Q; that reconstructs i” by specifying the codewords 
W(Sj+p;L) plus the bits uncovered by them; its length can be bounded from 
above as follows: 


£(Q;) $C +2(L ay oe 


pj=0 


where C is a constant independent of j and of L. Further, (* * *) entails the 
following bound for the plain algorithmic complexity of i”; 


oti , 1 
CO) < min EQ) < Faq DAs) 
<is = 
1 isir; 
<C+2(L-1) 7 >, Hexen) 
j=1 p=0 
1 
SOPALI 2 N(s)&(s) 
se, 
n-L+1 
<C+%{L-1) += = D p(s) (— loga p(s) +1) 
sen 


L 
H(n yr 


From ergodicity and (**), it follows that, when n — ov, 


Nle) p(s) = x(a’) 


n-L+1 


L-1] 


for 7-almost all sequences 2 € 22, where cl: is the cylinder set containing 


all i € 22 with s € oF as initial prefix. Thus, when n — oo ae tends to 
the probability distribution 7” over the partition C™) = fora of Rə 


(L) 
2 
indexed by the strings s € a), then, by continuity, 
1 CH) 41 
lim sup —C(é™) < ( = , wae 


By taking L — oo, the upper bound follows (see Remark 3.1.1.1). 
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The previous result that holds for ergodic binary information sources can 
easily be extended to generic ergodic sources and then to ergodic dynamical 
systems via Definition 4.2.1. 


Proposition 4.2.1. Let (X¥,T, u) be an ergodic dynamical system and P a 
finite, measurable partition of X; then 


oa P) = hS (T, P) u— a.e. 


Proof: The partition P defines a symbolic model (Bp, Ts, up) which is an 
ergodic shift-dynamical system. The result follows since Brudno’s theorem 
ensures that for up-almost all i € 2p, hence for -almost all x € X, it holds 
that c(i) = h(wp) = hiss (T, P). 


Corollary 4.2.1. Let (¥,T, u) be an ergodic dynamical system and P a fi- 
nite, measurable generating partition of X; then 


c(x, P) = n (T) u—a.e. 
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A way to eliminate the logarithmic correction that spoils the subadditivity of 
the plain algorithmic complexity (see Example 4.1.6) is to ask that the only 
acceptable programs for the UTM {4 are the so-called self-delimiting ones, 
namely those containing the specification of their lengths, so that the UTM 
always knows when its input programs end. These programs have the prefix 
property that if U halts on one of them, say p, then p cannot be the prefix of 
any other halting program for U. Any TM that accepts only programs with 
the prefix property is called a prefix TM ; it can be showed [78] that there 
exist prefix UTMs capable of simulating the behavior of any other prefix TM. 
The consequences of the prefix constraint are far reaching. One first proceeds 
to define an adapted version of algorithmic complexity of binary strings (the 
extension to strings from different alphabets is straightforward). 


Definition 4.3.1 (Prefix Algorithmic Complexity). The prefix algorith- 
mic complexity of i”) € 2”) is the length of the shortest program p such that 
Sp) = i™ , where 4 is any chosen reference prefix UTM : 


KG) = min S 0p) : Up) = i , 4 a prefix UTM } . 
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Remarks 4.3.1. 


1. A prefix TM can be figured out [78] as a TM with a control unit, two tapes 
and two reading-write heads. The first tape, the program tape, is entirely 
occupied by the program which is written as a binary string between two 
blank symbols marking its beginning and its end; the program is read by 
a head that can only read, halt and move right. The second tape, the work 
tape, is, as in the case of an ordinary TM , two-way infinite and the head 
on it can read, write 0,1, leave a blank #, halt or move both right and 
left. The computation starts with the head on the program tape scanning 
the first blank symbol, the other head on the 0-th cell of the work tape, 
only finitely many of its cells possibly carrying non-blank symbols, and 
with the control unit in its initial ready state qr. Then, in agreement 
with the symbols read by the two heads and the control unit internal 
state, the head on the working tape erases and writes or does nothing 
and then moves left, right or stays, the head on the program tape either 
moves right or stays, while the control unit updates its internal state. The 
computation terminates if the reading head on the program tape reaches 
the end of the program, in which case, the output is what is written on 
the work tape to the right of the cell being scanned by the head until 
only cells with blank symbols are found. The program halts if and only 
if the head on the program tape reaches the end of the tape. 

2. Since the set of programs with the prefix property is smaller than the set 
of all programs, then 

ca™) <KG™). 


On the other hand, if p is such that C(i™) = ¢(p), then, considering its 
self-delimiting encoding p* := G(€(p))p, it follows that 
K(i™) < E(p*) < C(é™) + 2log Lp) +C. 


3. The prefix complexity is subadditive; in fact, if p and q are programs such 
that K(i) = (p) and K(j) = (q), with 2,7 € 2*, then, since p and q are 
now, by definition, self-delimiting, one has 


K(i, j) < K(i) + KG) +C. 


4. Unlike for the plain complexity (see Remark 4.1.4.4), one can rightly 
define random those sequences 2 € N2 for which 


KGi™)>n-c, 


for all their prefixes i‘”), that is all those sequences whose prefixes i™ 
have prefix complexity that increases at least as n. Indeed [310], it turns 
out that these sequences are those and only those passing all constructive 
statistical Martin-Lof tests checking whether they belong to effectively 
null sets (see footnote 1). In this sense, relative to the prefix definition 
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of algorithmic complexity, Levin’s chaoticness and typicalness mentioned 
in the introduction to this section are equivalent characterization of ran- 
domness. 


One of the most important consequences of working with prefix UTMs L is 
that their halting programs p form a set of prefix codes for the output strings 
U(p) = i € 23 and their lengths satisfy the extended Kraft inequality (3.2.2). 


Example 4.3.1. Consider a prefix UTM 4 and the so-called Chaitin magic 
number [80, 92, 50] defined by 2 = 5 274P), where the sum runs over 
p:U(p)| 
all halting programs p; because of the prefix property, 2 < 1. 
Let us consider the binary expansion of 2 which has infinitely many Os if 
it is rational and suppose an algorithm exists that calculates the digits of 2. 


Then, the n-digit approximation 2, := 5 z is such that 2, > Q — 27”. 
j=1 
Then, one knows whether 4 halts on programs of length < n. 
Indeed, by listing them in lexicographical order and by processing them 
in dovetailed fashion, one can collect all programs pj, p2,... that halt until, 
after T(n) computational steps, 


m(n) 
Dna 5 oS Qn. 


If p is any program halting in more than T(n) computational steps, one gets 
N > Sn +OP > Rp, +P > Oe 28, 


Therefore, (p) > n so that if a program of length shorter than n has not 
halted in T(n) computational steps it will never halt. 

Let G(n) be the set of strings i; := U(p;), j = 1,2, ..., m(n), correspond- 
ing to the outputs of the programs that have halted in T(n) computational 
steps and let 4 denote the first string (in a suitable order) not in G(n). Such 
string must have prefix complexity K(i) > n: indeed, if K(i) < n, there 
would exist a program p of length < n such that U(p) = i. However, from 
the previous discussion one deduces that also p must have halted in T(n) 
computational steps so that i € G(n), too. Further, let p* be any shortest 
effective description of the string 2) := wwz- -wn consisting of the first 
n bits of Q, namely K(Q™) = (p*). Then, by means of a fixed number c of 
extra bits, one can use the knowledge of the Q™ to recover i, whence 


n < K(i) < Up) + c=K(A™) +e = K(R™) > n-c Vn. 


Then 2 is a random sequence in the sense of Remark 4.3.1.4. 
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Definition 4.3.2. Given a prefix UTM 4, the map NZ 3 i> Py(t), where 


Pais: XO 20 (4.11) 


p:U(p)=i 


defines a so-called the universal probability on 93. 


This definition makes sense, for, as a consequence of the prefix property, 
not only (4.1) holds, but it also turns out that 


Seo > XY 270 <1. 


iE. GDF p:U(p)=i 


Remarks 4.3.2. 


1. If a prefix TM 2 halts on p = 0 and q = 1 with the strings 2 and j as 
outputs, then Pali) = Palj) = 1/2 since no other program can halt. 
Without the prefix restriction the sum in (4.11) would diverge simply 
because all programs prefixed by p and q would also output 2 and j. 

2. After division by jeg» Pauli), Pu(i) represents the probability that i 
be the output of U running a binary program p of length ¢(p) randomly 
chosen according to the Bernoulli uniform probability distribution that 
assigns probability 2~") to anyone of them. Since short programs have 
higher probabilities, random strings have smaller algorithmic probabili- 
ties than regular ones. 

3. The probability Py is called universal (see Example 4.1.7) for the fol- 
lowing reason. Let X be any prefix TM and q a program such that 
A(q) = i € NJ; further, let q’ be a self-delimiting program of fixed length 
L that makes 4 simulate X so that U(q'q) = A(q) = i. Then, 


Py) = SP 2M > J) wea) =E Pali). (4.12) 
p:U(p)=i q: U(q'q)=i 


Suppose now 7 = {p(i)}icn; to be a computable probability distribution 

over §23 (see Definition 4.1.2). Consider a prefix TM 2 that does the 

following: 

— it computes the probability distribution 7; 

— it encodes the strings 2 € 3 by means of the Shannon-Fano-Elias 
code corresponding to the computed 7 (see Example 3.2.3); 

— given a program q € 25, it checks whether q is the Shannon-Fano- 
Elias code for any 2 € 925; if so, it outputs t. 

Since the lengths of the code-words are as in (3.23), then, for all i € 3, 


Py) = XO 2“) >4p(i). 
A(q)=t 
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For the prefix UTM 4U to work as 2, it is necessary to compute the 
probability distribution 7, whence the program q’ in (4.12) is such that 
L = K(7) + L’, where K(z) is the prefix complexity of m, where it is 
understood that the computable probability distribution m is written as 
a binary string (denoted by the same symbol). Then, for all computable 
probability distributions m on 925, 


Pali) > C2-K™ pia) , (4.13) 


with C > 0 a constant independent of i and 7. 


Universal probability, prefix complexity and Shannon entropy of com- 
putable probability distributions are intimately related. Given a prefix UTM 
U, the programs p* such that U(p*) = i € Q3 with ¢(p*) = K(i) provide a 
prefix code such that 


Pali)= XO 2) ao, (4.14) 
U(p)=i 


Further, if the strings 2 are chosen at random with respect to a computable 
probability distribution 7, then (3.22) implies that the corresponding average 
length, namely the average prefix complexity, satisfies 


do pli) K(é) > Ha(w) = — SF pli) log, pli) - (4.15) 


ie OX iEn 


There might be infinitely many programs such that U(p) = i, yet the lower 
bound in (4.14) is surprisingly good as the sum is actually dominated by the 
shortest programs for 2. 


Proposition 4.3.1. For all i € R3, Pali) < C2-*™, where C > 0 is a 
constant independent of i. 


Together with (4.14), this result permits the identification (up to an ad- 
ditive constant) of the prefix complexity of a string with minus the logarithm 
of its universal probability. 


Corollary 4.3.1. K(i) = — log, Py(t) + O(1). 


There thus appears a similarity between the fact that the optimal code- 
word lengths with respect to a probability distribution 7 = {p(t)}ie, are of 
the form ¢* = — log, p(i) and the fact that the lengths of the shortest descrip- 
tions of binary strings practically amount to the logarithm of their universal 
probabilities. This similarity can be carried even further by examining the 
average complexity. 
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Corollary 4.3.2. Given a computable probability distribution m on (23, the 
corresponding average prefix complexity satisfies 


Hp(m) < X. pi) K(i) < Holm) + K(m) +. 
i€QS 
Proof: From Proposition 4.3.1 and (4.13) 
K(i) < — logs Puli) + C < — logs pli) + K(m) +C. 


Multiplying by p(i) and summing over 7 € 23 yields the upper bound, 
while (4.15) gives the lower bound. 


Proof of Proposition 4.3.1 : The idea is to construct, for each 2 € 3, a 
set of programs p of length ¢(p) < — log, Py (i) + C” with the prefix property 
such that U(p) = i, so that K(i) < (p) would end the proof. Unfortunately, 
the argument of Remark 4.3.2.3 is not viable as the universal probability is 
not computable. However, as much as for the plain algorithmic complexity, 
the prefix complexity is semi-computable from above whence the universal 
probability results lower semi-computable because of Corollary 4.3.1; this 
turns out to be sufficient for constructing a prefix code with the desired 
property. Let all the programs (listed in lexicographical order) be run by UL 
in dovetail fashion and collect them in pairs (pp, £k) where pp is the program 
which halts at the k step of the dovetailed computation with a, E 25 as 
output. The quantity 


Palk, =£):= XO 20 < Pyu(æ) 


(pj. 2, =e) 
i<k 


is computable and tends to Py(a) along the subsequence {(pz, £p = £)}x; 
set np := |— logs Pu(k, x, = x)]. Since 


9—4 (k) < Py(k, £p = x) < g—ba(k)+1 f 


where Z, (k) is the smallest length in the sum, it follows that ng = 4, (k). Given 
(Pk, £k, Nk), this triplet is assigned to the first non-occupied node at the (ng + 
1)-th level of a binary tree; further, in order to enforce the prefix condition, all 
nodes stemming from it are made unavailable to further assignments. Since 
nz is not strictly monotonic, it may happen that different pairs (p;i, £i = £x), 
i < k, have the same nz; by eliminating all but the first pair with that value 
of nk, no more than one node will be occupied by a triplet with the same x; 
at level ng. Therefore, 


ng > — log, Pulk, £p = £) > — log, Py (x) => np = |— log Pulæ)| + rk 


with rẹ > 0 and ry Æ rj for j A k. To each x € 3 there correspond many 
assignments of triplets (Pk, £k = £, Nk) each one of them to one and only 
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one node at level ng +1. The nodes thus provide binary code-words of length 
nz +1 for the triplets. In order to see that there are sufficiently many nodes 
to accommodate all triplets, we check that the lengths n;+1 satisfy the Kraft 
inequality (3.21). That this is indeed so follows from the fact that 


y g-nk — 97|- loga Pu(æ)] iS 277k < 2 Pu(æ), 


TTk5T£ LE =e 


for all x € 223, whence 


Yo yee Ss Pe 


TENZ L_p=x we M3 


The above algorithm allows the construction of a binary tree whereby any 
x € N3 can be identified with the binary string i(æ) € X corresponding to 
the lowest depth node assigned to its triplets (pk, £k = £, nx). The length of 
the code-word i(a) € 23 is the smallest ng + 1: 


((i(æ)) < [logs Pa(æ)] + 1 < — logs Pu(w) +2. 


Finally, let q be a program of fixed length L that makes the prefix UTM U 
generate the binary tree by dovetailed computation as specified above and 
let q’ be another program of fixed length L’ with the necessary instructions 
to U such that, when presented with the code-word q'qi(æ), U computes the 
program in the triplet assigned to the node marked by i(æ), writes the result 
and halts. By construction, the program p in the triplet at the node i(a) is 
such that U(p) = x; then 


K(x) < €(q'qi(x)) = — logy Pule) +04 L'+2. 


Remark 4.3.3. Because of its construction the universal probability is a 
lower semi-computable semi-measure (see Definition 4.1.5), thus there exists 
a constant Cp such that Cp Py < m, where m is the universal semi-measure 
constructed in Example 4.3.2. Furthermore, an argument similar to the one 
in the previous proof, extends the result in Corollary 4.3.1 to 


K(i) = — logy Puli) + O(1) = — loga m(i) + O(1). 
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Possible uses of algorithmic complexity theory in relation to the predictabil- 
ity of discrete classical dynamical systems are discussed in [62]. Approaches 
to complexity issues in a broad sense can be found in [18, 294] 
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Quantum Dynamical Systems 
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In the second part of the book quantum dynamical systems with finite 
and infinite degrees of freedom are presented by using the algebraic approach 
to quantum statistical mechanics. The corresponding technical framework 
proves convenient for the extension of ergodic and information theory to 
non-commutative contexts. 


5 Quantum Mechanics of Finite Degrees of 
Freedom 


Quantum dynamical systems are described by means of non-commutative 
algebras of observables, by means of their time-evolution and by means of 
the expectation functionals that assign mean values to them. Classical dy- 
namical systems can always be described in terms of phase-points and phase- 
trajectories; however, an algebraic formulation is always possible and has two 
advantages: on one hand, similarities and differences with respect to quan- 
tum dynamical systems become more evident and, on the other hand, one 
can infer from the algebraic reformulation of classical notions how to possibly 
extend them to the quantum setting. 

With reference to information, the most important difference that one 
encounters passing from the commutative to the non-commutative setting is 
that the disturbances exerted on quantum systems by measurement processes 
cannot in general be made negligible, not even in line of principle. 


5.1 Hilbert Space and Operator Algebras 


In standard quantum mechanics, physical states are usually described by 
normalized vectors in separable Hilbert spaces, and the observables by self- 
adjoint linear operators acting on them. Here follows some notations and 
basic facts. 


1. |v), |), or w, ¢, and |i), with ¿i running on a suitable index set J, will 
denote (normalized) vectors in Hilbert spaces H and Py = |w)(w| the 
associated orthogonal projectors. 

2. The scalar product on H, denoted by (w|@), linear in the second ar- 
gument and anti-linear in the first one, satisfies the Cauchy-Schwartz 


inequality 

Kylo) < lellig- (5.1) 
Any finite or countable set {W;};icr C H such that (W; |W;) = ði; and 
Ib) = Vier(4|V)|%) for all y € H, is an orthonormal basis (ONB) 


in H. The corresponding projectors P; := | W; )( W; | fulfil 
R=) |ANA] =1, (5.2) 
icI icI 
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where 1 denotes the identity operator on H, 1|w) =|w) for all y € H. 
3. Given any linear operator X on H, its matrix elements with respect to 
p, ġo € H will be denoted either as (y| X¢) or as (w |X |}, depending 
on notational convenience. 
4. XT and X* will denote transposition and complex conjugation with re- 
spect to a given ONB {W}; };: 


(n| XTE) = (D| XT), (D| X0) = (P| Xy. 


Instead, Xt = (X7)* = (X*)? will represent the basis-independent ad- 
joint of X: 


(YIX) = (Xlo) = (PIXY) Vd, de H. 


Physical observables correspond to self-adjoint operators X = Xt. 
5. The uniform norm, ||X||, of a linear operator X on H is defined by 


IX] := sup |X| 4). (5.3) 
Ip||=1 


6. X is bounded if || X || < co, in which case 


XI) SAIN, Mel XY)| SAMMY - (5.4) 


Linear combinations of bounded operators are again bounded; their lin- 
ear span will be denoted by B(H). The product of bounded operators is 
bounded, for || XY] w)|| < ||X]| ||¥]| ||w||. Therefore, BCH) is a so-called 
*-algebra. 

7. An operator U € B(H) such that UU = 1 is called an isometry; in 
general, U Ut = (UUt)(U U1) is a projection, if also U Ut = 1, then U is 
a unitary operator. Isometries have ||U|| = \/||U? U|| = |/1|| = 1. 

8. The uniform norm defines on B(H) uniform neighborhoods of the form 


U(X) = {Y € BCH); |X-Y|| <e}, e20, (5.5) 


whence a sequence X, E€ B(H) converges uniformly to X € B(H), 
limno Xn = X, if limno |X — Xnl|| = 0. The corresponding topology 
on B(H), Tu, is called uniform topology. 

9. B(H) is complete with respect to the uniform topology, namely all se- 
quences of operators which are of Cauchy type with respect to the uni- 
form norm converge to an element of B(H). Therefore, B(H) is a so-called 
Banach *-algebra. Moreover, since the uniform norm fulfils 


IXH =X” XT XI = IX]? , (5.6) 


B(H) is a C*-algebra (see Section 5.2). 


10. 


11. 


12: 


13. 


14. 
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In the case of n < co degrees of freedom, each one of them is described 
by a Hilbert space H;, 1 < j < n. Altogether, their Hilbert space is 
the tensor product H™) = Q; H,, denoted by H8” when the Hilbert 
spaces H; are copies of a same H. Depending on notational convenience, 
its vectors will be denoted either by |) = | Y1) @|%2) @--++| vn) or by 
|W) =| @ de ®-++ Un), with scalar products (|Y) = [Tt (8; |v): 
Bounded operators on H are linear combinations of tensor products of 
the form X; ® X2 8 --- Xn, X; € B(H,); the associated C* algebra of 
bounded operators on Hi”) is B(HI™) := Q; B(H;). 

The strong topology on B(H), Ts, is the smallest topology with respect 
to which all semi-norms of the form £L,(X) := ||X|w)||, Y € H, are 
continuous; its strong neighborhoods are of the form 


Us(X) := {Y € B(H) : Ly,(¥ -X) <e, 1<j <n}, (5.7) 


for Y; € H, n € N and e > 0. A sequence Xn € B(H) converges strongly 
to X € B(H), s — limp Xn = X, if limp. || (Xn — X)|w)|| = 0 for 
all y € H. 
The weak topology on B(H), Tw, is the smallest topology with respect to 
which all semi-norms of the form Lg (X) := |(¢|Xwv)|, ¢, Y € H are 
continuous; its weak-neighborhoods are of the form 


U? (X) := {Y € B(H) : Løy Y- X) <E, legen}, (5.8) 


for Yj, j € H, n € N and e > 0. A sequence Xn € B(H) converges weakly 
to X € B(H), w — limn>œ Xn = X, if limp. |(¢ | (Xn — Xy )| = 0 for 
all ¢,~ € H. 

Since strong neighborhoods are uniform neighborhoods, but the reverse 
is not true when H is infinite dimensional, the uniform topology is in 
general finer than the strong one, that is 7, has more neighborhoods 
than Ts: Ts < Tu. The weak topology is in general coarser than the strong 
one; every weak neighborhood is also a strong neighborhood, but the 
reverse fails to be true in infinite dimensional H. The norm, strong and 
weak topologies are equivalent in finite dimension. 

Among other topologies on B(H) [64], one of some use in the following is 
the o-weak topology, Tuw; it is finer than the weak topology for it is the 
smallest one that makes continuous the following semi-norms, 


LEG ny (X) = X Idn | X ln DI 5 (5.9) 


where {tn}, {n} C Hare such that >, ||Pnl|? < œ and £, lonl]? < o. 


Most of the previous assertions are standard facts [64, 251, 300]; however, 


the various topologies on B(H) deserve a closer look. 
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Remarks 5.1.1. 


1. That the uniform topology is finer than the strong topology can be seen as 
follows. Given any strong neighborhood U3(X), let a := maxı<i<n ||Yill, 
then Utal X) C UE(X); indeed, 


Y EUa = X -PS Éil < e = Y Eux) , 


whence U(X) is a uniform neighborhood, too. In order to show that Tu 
is in general strictly finer than Ts, it is sufficient to exhibit a sequence of 
operators in B(H) which converges strongly, but not uniformly. To this 
end, suppose H to be infinite dimensional, choose a ONB {Wk }ken with 
associated orthonormal projectors P, and construct Qy := y Pe. 
Then, (5.2) reads s—limy Qn = 1; namely, if Y € Hand cy (i) = (Yi |v), 
then 


oO 


jim Qu- 1/4)? = im ST em?s. 
n>N-+1 


On the other hand, Qxy — 1 cannot hold in the uniform sense, otherwise 
for any € > 0 there would exist No(e) such that, if N > No(e), then 
(Qu — |b) || < £ uniformly in ý € H, while ||(Qw —1)|¥)|| = f] for 
all w in the subspace orthogonal to that projected out by Qn. 

2. In like manner, the weak topology cannot have more neighborhoods than 
the strong topology. Given UY (X) as in (5.8), set 6 := maxi<i<n llill; 
then U2/4(X) CUP(X); indeed, using (5.1), 


Y E€ Ua (X) = K(X- Y) Wi) < SIldill Se = Y Ee U” (x) . 


In general, Ts is strictly finer than Tw. Let H be infinite dimensional and, 
given an ONB {W }ken, consider the operator X : H > H defined as the 
right shift along the ONB : 


= + = 0 k=1 
Xha) =d a.y too 
Note that XIX| Wk) = |%) for all k € N so that XIX = 1; X is an 
isometry with XXt projecting onto the subspace orthogonal to Y. Fur- 
thermore, by expanding H > |W) = XZ cw(k)| We), ewlk) = (Yl), 
it turns out that 


IXP)? = Cb] (XX y) = [IP , 


whereas 
w-— lim X” =0. 


n—CoO 
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Indeed, given ¢,7 € H, € > 0 and |wx) = EÉ ylit) uthat 
Ily) -lyg )I| < e, (5-1) yields 


kelx” ko) < Klx” Weed] sie onati + ell 
w=1 


kK K+n 
<4] > les? 4) D2 lew@l? + elel, 
i=l i=n+1 


where the second square-root becomes negligibly small for sufficiently 
large n. 

3. By adding to a subalgebra A C B(H) its limit points with respect to a 
given topology T, one obtains its closure A’. If of two topologies 71,2 on 
A C B(H), Tı is coarser than T2 (71 < 72), T} has less neighborhoods 
than 7) and thus more convergent sequences; therefore, A” D A”. In 
particular, A ™ is a C*-subalgebra of B(H); further, since Tu = Ts = Tw 
it follows that A™ CA™ CA™. 

4. Given a *-subalgebra A C B(H), consider the linear functionals F : 
A™ = C, respectively F : A™ + C, that are continuous with re- 
spect to two topologies Tı < T2; more precisely, the preimages F~!(V) of 
open sets V C C are open sets in A 7, respectively A 72. Then, since not 
all open sets in A ™ are open sets in A 7, a 79-continuous F’, that is con- 
tinuous with respect to the finer topology, may fail to be 7-continuous, 
that is continuous with respect to the coarser topology. For instance, all 
weakly continuous linear functionals on A C B(H) are strongly continu- 
ous but strong continuity does not in general ensure that a functional is 
also weakly continuous. 

5. If A is a generic Banach algebra, its topological dual, A* is the linear space 
A* consisting of all linear functionals F : A > C that are continuous on 
A. Then, A* can be equipped with the so-called w* -topology, namely with 
the coarsest topology that makes continuous all semi-norms of the form 


LY (F)=|F(X)|  VXEA. (5.10) 


Its neighborhoods are of the form 
UL" (F):={GEA* : LY (G-F)<e,1<j<n}, (5.11) 


for any X; E€ A, n E€ N and e > 0. A sequence F, E A* w*-converges to 
F € A*, w* — liMmn>o Fn = F, if limps |Fn(X) — F(X)| = 0 for all 
XEA. 


5.2 C* Algebras 


The bounded operators on a Hilbert space H form a Banach -algebra with 
respect to the uniform norm (5.3); this norm fulfils the two equalities (5.6). 
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While the first one follows at once from (5.3) and the definition of adjoint, 
the second one is proved by using (5.1): 


|X|? = ea < XXI = XI < XM; 


thus, exchanging X and Xt, yields || X|? = || Xt]? = || XŤ X||. From the fact 
that ||XY|| < || XI| |Y || (an inequality that follows at once from (5.3)), one 
gets || XT| = ||X]||; in fact, 


IXI? = XTX] < XIX = IXI < XT, 


while | Xt? = XX+} < XX = [|X] < IXI. 
More in general, let A be an algebra with an involution f : At A such 
that 
(aA+6B)'=a*A'+p* BI, (AB) = BÝ At 


for all œ, 8 € C and A,B € A. Let A be complete with respect to a norm 
|| -|| : A= Ry such that 


la All = |a] lAl; A+ Bll < HAL + IBI; TABI Ss TANI 


and || A|| = 0 — > A = 0 for alla € C and A,B € A. If the norm further 
satisfes (5.6) it is called a C* norm. 


Definition 5.2.1. Any Banach *-algebra A with respect to a C* norm is 
called a C* algebra. A is called unital if it possesses an identity 1 such that 
AI1=1A=A forall AEA. 


Examples 5.2.1. 


1. The commutative algebras C(¥), respectively L? (X) of continuous, re- 
spectively essentially bounded functions over a compact phase-space ¥ 
discussed in Section 2.2.1 are C* algebras with respect to the uniform, 
respectively essentially bounded norms. 

2. Many instances of quantum systems are N-level systems; their Hilbert 
space is finite dimensional and thus can be taken as H = C^, while 
their observables are Hermitian N x N matrices with complex entries. In 
such cases, the C* algebra of bounded operators B(H) is the full matrix 
algebra My(C). Given an ONB {Wj}, in CY, set Ei := |Wi)(Y; |, 
i,j =1,...,N; then, 


N 
Eal) = (Gv) |4), EijEre = Opn Eu , b=; (5.12) 


i=1 
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Any set of matrices with these properties constitutes a set of matrix units, 
E;; being a complete set of orthogonal projections. Any X € My(C) can 
be thus expressed as a linear combination of the matrix units: 


N N 
X= X By X By = D> Xy Ey. 
ij=l ij=l 


In the standard representation where the basis vectors have the form 
[¥,)=(0 O - 1 = 0 0), 


with 1 in the j-th entry, the matrix units K;ij are N x N matrices whose 
entries are all 0, but for the ij-th one which is equal to 1. 

. A typical scenario often encountered in quantum physics is as fol- 
lows: a quantum system described by a generic (not necessarily finite- 
dimensional) Hilbert space H is coupled to an N-level system, the corre- 
sponding algebra being the tensor product Mj (B(H)) := My(C) ® B(H) 
consisting of operators of the form 


Xu © Xin 
N : éd : 
X=) ROX G= | © es = [Xj] , (5.13) 
i,j=l : Aare ; 
Xni © XNN 


where X;; are operators in B(H) and £;,; are matrix units in standard 
form. The tensor product Myn (B(H)) is a x-algebra of operators acting 
on the Hilbert space H := CY @ H consisting of vectors 


oo N | v1) 
Ha|lvy=S lth@ldiy=] : J. (5.14) 
(= [YN ) 


A uniform norm on My (B(H)) is defined by 


l? = sup (4| XTX |b) 


lel =1 
N N 
=A D> l A lds): Do = e 
i, j,k=1 i=1 


Indeed, it turns out that it satisfies (5.6); beside, My (B(H)) is a complete 
*-algebra with respect to it, thence a C* algebra. 

. Given X,Y € B(H), B(H) > [X, ,Y] := XY —YX denotes their commu- 
tator. Let V C B(H) be a linear self-adjoint subset, that is it contains the 
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adjoint of any of its elements. The commutant of V, denoted by V’, con- 
sists of all bounded operators that commute with all X e V. If X’ € Vy’ 
also (X’)' € VW’; further, if X’,Y’ € V’, then 


[X’Y’, X] =X’ [Y’, X] + [X’, X]y’=0 vxev. 


Therefore, X'Y’ € V’ and V’ is a *-algebra; also, if a sequence of X}, € V’ 
uniformly converges to X € B(H), then, 


LX, XVI] = IX- Xr, XVI S 2X" — AAI 


implies that V’ is uniformly closed and thus a C* subalgebra of B(H). 

5. If A C B(H) is a C* subalgebra with commutant A’, the center of A, 
Za := ANA’, contains all those A € A that commute among themselves 
and is thus an Abelian C* subalgebra of B(H). 


Given a bounded operator A in a unital C* algebra A, a — A is invertible 
in A (a stands for a1) if there exists B := (a—B)~' € A for which (a—A)B = 
B(a—A) = 1. The set of such a € C is called the resolvent set of A (Res(A)). 
Its complement is the spectrum of A (Sp(A)). 


Examples 5.2.2. [64] 


1. Let A € A and a € C such that ||A|| < |a|, then by Taylor expansion 
1 1 /(A\" 
a-A a 3 ( a ) 
the series converges in norm and gives rise to a well-defined operator in A. 
Therefore, Sp(A) is contained in the subset of a € C such that |a| < ||A|]|. 


Furthermore, if ag € Res(A) so that (a — A)~! exists in A, choose a € C 
such that |a — ao| < ||(ao — A)~1]|; then, 


Lo 1 a. - ag—a\” 
a—-A  (a—a9)+a9-A ag9—-A ao— A 


exists as well, whence Res(A) is an open subset of C and Sp(A) a closed 
subset of C. 

2. For a,b € C and A € A, a— (b — A) is invertible if and only if (b—a)— A 
is invertible. Thus Sp(b — A) = b — Sp( A). 

3. Fora € Cand A € A; a—A is invertible if and only if a* — At is invertible, 
whence Sp(A‘) = Sp(A)*, the conjugate set of Sp(A). 

4. If A is invertible, using A~! one writes 


a—A=aA(A'-a"), atA =a ta l(A-a). 


ND 


9. 
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Therefore, if a— A is invertible, then a~'— AT! turns out to be invertible 
too and vice versa; therefore, Sp(A~!) = (Sp(A))~?, the set consisting 
of the inverse of each element of Sp(A~!) (notice that 0 ¢ Sp(A) and 
a € Sp(A) = |a|} < || ATH] < +00). 


. The spectral radius R(A) of A € A is [64] 


R(A) := sup{|A| : à € Sp(A)} = lim |A”. 


An operator A € A is normal if At A = A AT; then, R(A) = || A||. Indeed, 
the C* properties of the norm yield 


AP"? = |](AN)?” 4?" = [ICAP A)?" I] = (AEA)? (ah A)? 
= \|(AT A)T |]? = || At All?” = Jaz” , canine 
R(A) = i P = [1A 


. Self-adjoint A > A = Al are normal and hence R(A) = |All. 
. If U is unitary (Ut U = U Ut = 1) or isometric (Ut U = 1), then 


|r|? = (OT) U” = ty tote = uty | 
= |U U| = ||| =1. 


Therefore, Sp(U) is contained within the unit circle {x € C : |z|| < 1}. 
On the other hand, if U is invertible, U~' = Ut and the preceding point 
3 implies that Sp(U) = {z E€ C : |z| = 1}. 

If A5 A= Al and |a|~! > |All, then from point 1 above one deduces 
that —iļa| 7t — A = —i|a|(1 — iļa| A) is invertible so that 


ASU := (1+ iJa|A)(1 — ija] A)! 
is a well defined unitary operator; moreover, the last point ensures that 


1 — ijalz 2ila| . zi 

U = A 14 A 
1+ iļaļz Tae eisai) 
—_—” 


w 


is invertible whenever |w| # 1, namely whenever S(z) 4 0. Therefore, 
A — z is invertible and Sp(A) C [-||Al|, | A||] because of points 3 and 6. 
Let P(z) be a polynomial of degree n on C, A € A and for a € C write 


P(z)-a=a]](z-a), a,a,EC 
i=1 


P(A) -a =a] [(4- a) ; 
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The operators A — a; commute, thus P(A) — a is not invertible if and 
only if at least one of them is not invertible, that is a € Sp(P(A)) if 
and only if at least one a; € Sp(A). Since P(a;) = a, it follows that 
Sp(P(A)) = P(Sp(A)), the set of values attained by P(z) on Sp(A). 

10. Suppose A > A = Al, from the previous result and point 8 it turns out 
that Sp(A*) = (Sp(A)*) € [0, || All?]. 


Remark 5.2.1. If A € A is self-adjoint, then by the density of the polyno- 
mials in the commutative C* algebra of continuous functions f over R, one 
can extend Example 5.2.2.8 to f(Sp(A)) = Sp(f(A)). This is the spectral 
mapping theorem |64, 324]. 


5.2.1 Positive Operators 


Particularly important bounded self-adjoint operators are the positive ones, 
that is those whose spectrum consists of non-negative values; from a physical 
point of view, they represent observables that, when measured, always returns 
a positive outcome. 


Definition 5.2.2. An operator A of a unital C* algebra A is positive (A > 0) 
if A = At and Sp(A) C R}. Given A,B € A, one sets A > B whenever 
A-B>0. 


Remark 5.2.2. Positive operators A 5 A > 0 are characterized by being 
of the form A = BiB, for some B € A and by having a unique positive 
square-root VA such that A = VAVA [64] (see also Example 5.3.4.2). 
When A = B(H), the positivity of a self-adjoint operator X € B(H) 
amounts to (Y| X4) > 0 for all y € H, which corresponds to the positivity 
of all its eigenvalues x; E€ R such that (X — 2;)|~) = 0 for some |) € H. 
Denote by |X| := VX? X the unique square-root of the positive operator 
Xt X. The map V : Ran(|X|) + Ran(X) defined by V|X||w) = X| y), 
where Ran(X) denotes the range of X € B(H), ! is a partial isometry, 


IVIXI YMI? = (eb | XEX |W) = MT DIP 


Let U denote the partial isometry which equals the extension of V on the 
closure of Ran(X) and 0 on Ker(|X|) = (Ran(|X|))+, then X = U|X| is the 
so-called polar decomposition of X [64]. It is unique; namely, if X = VB with 
B > 0 and V is a partial isometry with V = 0 on Ker(B), then 


"The range of X € B(H) is the linear subset of vectors of the form |) = X|¢) 
for some ¢ € H. Ran(X') L Ker(X), where Ker(X) is the kernel of X that is the 
closed subspace of vectors w € H such that X|w) = 0. 
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X'X = BV'IVB = B’? = B=|X| 


by the uniqueness of the square-root; further, U = V for both annihilate 
Ran(|X|)+. The projection p := U'U is called the initial projection of U, 
while q = UU' its final projection. 

In the simplest cases B(H) = My (C), then |X| = Vv XtX can be spectral- 
ized, |X| = $X; zil V; )( Y |. The eigenvalues z; > 0 of |X| are the so-called 
singular values of X, while its eigenvectors Y; form an ONB in C™. Using the 
polar decomposition, it turns out that any matrix can always be represented 
in terms of its singular values and of two, generally different, ONBs , 


N 
X =U|X|= X z|], |8;):= Ul). (5.16) 
j=l 


Also, if V is the unitary matrix that diagonalizes the Hermitian matrix |X], 
|X|=VDV', then X = W DV' with W := UV unitary. 


Examples 5.2.3. [10, 296] 


1. From Example 5.2.2.10 it turns out that the spectrum of the positive 
elements A > 0 of a C* algebra A is such that Sp(A) C (0, || All]. 

2. Suppose A > B > 0 for A,B € A; then, from the previous remark, 
A — B = CÏ C, whence 


D(A — B)D = (CDY (CD) > 0 = D AD > D'BD, 


for all D € A. A typical situation is when A = P, an orthogonal projec- 
tion which is always < 1, then Dİ PD < DİD. 

3. Let B(H) > X := Py — Py = |Y) Y| = [$)$], Y # $ € H. One can 
always write 


|) =al) +617) (d] oo) =0, a:=(¥/6), 8=vVy1- lal . 
Then, on the subspace K spanned by w and w+, X is represented by the 


2 x 2 matrix 2 
{B —Ba 
x= (&, 8). 


Thus X has eigenvalues +6 and eigenprojectors 


= 1+ sa 148 
Jt) =f Eer = 5) |v), 


where et? is the phase of a. Therefore, 


X=AIl+)(+1-l-\M-)> |X1=6Qx, 


where Qg = |+)(+|+ |—)(—| projects onto K. Further, U = 671X is 
an isometry on K that vanishes on K+. 
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4. If X =U |X| then Xt = |X| Ut whence 
KM SU XPOS UAI XU. 


Therefore, X' X and X X7 have the same eigenvalues with the same 
degeneracies apart, possibly, from the eigenvalue 0. 

5. Suppose 0 < X < Y, X,Y € B(H), then Y! < X~1. Indeed, X~! and 
Y—! exist; thus one can set Z := Y~!/2XY—!/2, Then, Z < 1. In fact, 
(see Definition 5.2.2) for all y € H 


(b|YV2XY T? jp) = (Y4 X Y7) 
< (Yy Y Yy) = (Yip). 


By the same argument, multiplication of both sides of the inequality 
Z < 1 first by Z-! = Y1/2X—-1y1/2 yields 1 < Y1/2X—1Y1/2, one then 
multiplies both sides of this inequality by Y~!/?. 

6. Let X < Y € B(H) be such that log X exists (see Remark 5.2.1 and 
Remark 5.3.4), then log X < log Y. This follows from the spectral calculus 
and the previous point, for t+ X <t+/Y for all t > 0 and the fact that 


+oo +00 
1 1 
log = = f dt -f d— , vey >00. 
y 0 t+ 0 t+y 


ae 1 i 1 
Then, log X —log¥ = f a - | dt —— <0. 
ó t+X t+Y 
7. Consider the setting of Example 5.2.1.3, that is the C* algebra My (B(H)); 


X > 0 if and only if (| X |b) = Eija (Yi | Xu bj) 2 0 for all ġ € F. 
By arbitrarily choosing Y; € B(H), ¢ € H and setting | Yi) := Y;| @), it 


follows that X > 0 if and only if 


Xu c+) Xin Yı 
oae e p : |20 
ij=1 Xni >e XNN Yn 


for all Y; € B(H). 
8. Also, X > 0 if and only if X = YY, Y € My(B(H)). Then, 


n N 
x= E yi Be YfYre = X | X Ep8 Yh Yre 
T 


k=1 \j,e=1 


Therefore, X > 0 if and only if it is a sum of matrices of the form [Y,'Y;], 
Y; € B(H). 

9. Consider the N x N matrix E := [Ei;] E€ Mn(Mn(C)) whose entries are 
matrix units Ej; € My(C). According to the previous point, BS 0s 
indeed, 


5.2 C* Algebras 151 


N N 
E= X Bj 8 By = Ņ_ Bi Q EļErj VRS L2 ea Na 
ij=l i=l 


Let {|i)}4, be the ONB such that Ej; = |i i) 5 |; then, E turns out to 
be proportional to the orthogonal projector PN 


1 = 
= F7) M = = an HGl@ltMG|= gE (5.17) 


rm 


onto the totally symmetric state 


CY @CN 5 WX) := al . (5.18) 


Finite Dimensional algebras 


A C* algebra A is finite dimensional if its dimension as a linear space is finite; 
as such it has an identity 1. In particular, its center Z4 (see Example 5.2.1.5) 
is a finite dimensional algebra whose elements all commute: such an algebra 
is called Abelian. It is generated by minimal projections {P;}"_, (see Exam- 
ple 5.3.4). Due to their orthogonality, A = @j_, Ai, where A; := A P;, with 
P; its identity operator, whence their centers are trivial and the A; simple 
algebras. In fact, A; cannot contain any non trivial ideal i C A, for, like 
A, also i has an identity E [296]; then, XE € i => EXE = XE s0 that 
E commutes with all self-adjoint X € A; and thus belongs to its center: 
EX=(XEÝ=EXE=XE. 

We shall set A = A; and show that it is isomorphic to a matrix algebra 
by constructing an appropriate system of matrix units {E;, itt jar 

Let B C A be a maximally Abelian subalgebra with minimal projectors 
{Q} such that QjQk = ĝjkQ; and De Q; = 1. For each Q;, one can 
always choose X; € A such that Y; := Q;X,;Q1 # 0; indeed, for all X € A, 
ix = {) 1 X: XY; : Xı,Y;i € A} is an ideal of A which, as A is simple, 
must coincide with it. 

Observe that YY; = Q1.X!Q;XjQ1 and Y;Y} = Q;X;Q1X/Q; commute 
with B; since B is maximally Abelian, they belong to it. Thus, Y/Y; =A;Qi 
and yy) = ujQ;. Further, A; = py > 0, for IW; Y; = PAA By setting 
Z; = Yj/ Aj and Fij := ZZ} it follows that Z'Z; = Qi, while Ej; = 
Z; 23 = Q; for all j. Moreover, 


QiXiQ1X1QjQpXpQi X1Q, 
<n 


Fij pg = 
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ty. 
YY; Yj 


——— E 
_ z, 2X101 Xj WXGQ1 OXI Qa 
ve J MiAg Aj 


Thus, the /;; are the required set of matrix units as they ae span A: for 
all X € A, ZÌ X Zj = (Qi. X!1Qi X Qj XjQ1)/./MAj = mi(X) Q1, whence 


Yı 


= jp Eiq i 


25 Ooi S zz! X 2,2} = T u )Z:Q:Z} 
í j=l 4,g=1 i,j=1 
d 
= Ñ m(X)E; 
ij=1 


Therefore, any finite dimensional C* algebra is isomorphic to the orthogonal 
sum of full matrix algebras: A ~ @;_; Ma, (©). 


Compact Operators 


If H is infinite dimensional, the matricial structure of My (C) carries over to 
the so-called compact operators, Bao (H). These are all X € B(H) such that 
|X| has a discrete spectrum of finitely degenerate eigenvalues that accumulate 
to 0, the only eigenvalue with possibly infinite degeneracy 7. It turns out that 
these spectral properties are preserved by linear combinations and operator 
multiplication [251, 270]. 

Practically speaking, compact operators are obtained by closing with re- 
spect to the uniform norm the *-algebra of finite rank operators, that is of 
the linear span of all possible X on H that are non-zero on finite dimensional 
subspaces, only, where they can be represented as usual matrices. As such the 
algebra of compact operators is a Banach *-algebra without identity operator 
for the only eigenvalue of 1 is infinitely degenerate. 


Trace-Class Operators 


Consider a matrix algebra My (C), the functional Tr : My (C) > C, 


N 
My(C) 3 X > Tr(X) = X (4| X |) , (5.19) 


i=1 
where {Wy} joa is any ONB in C%, defines a so-called trace on My(C). 


? The simplest example of compact operator is any projector P = | Y }( 4 | which 
vanishes on the orthogonal complement of p, whence its zero eigenvalues is infinitely 
degenerate when H is infinite dimensional. 
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The trace is basis-independent; indeed, because of (5.2); given any two 
ONBs {8}; Pii and {% H], 


N N 
S (9; | X |; ) =% 4 (Bj | Wie )( Vic | X [Pe )( Ye | P; ) 
yæl j,k l=1 


N N N 
D> (Z(E AD; |e) (Ge LX We) = SG |X We) 
k=1 


i] 


ci 
os 
Il 
m 
& 


Ske 


The trace of a matrix amounts to the sum of its diagonal entries, so Tr(X) > 0 
if X > 0. Further, it is cyclic; namely, for all X,Y € My(C), (5.2) yields 


N N 
Tr(XY) = X (2: | XY |) = 2 GAGE) 
r ‘j= 
= X (5Y |i) (Gil XW) = TY X) . (5.20) 


i, j=1 


Using the trace, one constructs the following map from My (C) onto R+, 
N 
[ella : X= Xa = THX] = Soa, (5.21) 


where xy are the singular values of X (see (5.16)). This map vanishes only if 
X = 0; also, from (5.4) and (5.16) it follows that 


N 
[TV X)| < $ ai\(V| YU %)| < VX (5.22) 
i=1 
|TrX] = |Tr(U|X])| < IX lh (5.23) 
IX + Z|]. = THUN(X + Z)) < Xn + IZh - (5.24) 
Therefore, ||- ||; is a norm on My(C) called trace-norm. 


If extended to B(H) with H infinite dimensional, the trace selects the 
linear subspace Bı (H) C B(H) of trace-class operators: 


B (H) := {x €B(H): Xli < oof 


If X € B,(H) then, by the polar decomposition X = }7,, &n| On ){ Wn |, where 
{dn} and {Yn} are two ONB in H, «x, are the eigenvalues of |X| = VXTX 
and the sum converges in trace-norm; also, |X| = 07°, vi. 
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Then, inequality (5.22) holds with Y € B(H), X € B,(H), (5.23) and 
(5.24) with X, Z € B,(H). The trace-class operators thus form a *-algebra 3; 
Bi (H) is also closed with respect to the trace-norm and thus a Banach *- 
algebra, without identity in infinite dimension for Tr(1) diverges [251, 270] . 


Example 5.2.4. [64] Any X € B(H) defines a linear functional 


‘xy : B(H)OC, p= Fx(p):= Tr(X p) Vp € Bi(H) , 


on B,(H) which is bounded for |F'x(p)| < ||X|| ||p||1. Therefore, B(H) can be 
identified with a subspace of B,(H)*, the topological dual of Bı (H), that is 
the (Banach) space consisting of all continuous linear functionals on B; (H). 
Actually, B(H) = Bı (H)*. Indeed, let F € B,(H)* and consider the bounded 
operator |ġ)( Y| with ¢, y € H not necessarily normalized. It is also trace- 
class; indeed, set Py :=|w)(w|/||w||?; then, 


Io )(a Ila = T (yele II? Py) = NI HI 


It thus follows that |F'(|¢)(w])| < | FI| |||] ||| for all ¢,~ € H. Therefore, 
each F € B,(H)* defines a so-called sesquilinear form on H x H, linear in 
the first argument, antilinear in the second one and continuous with respect 
to both. Consequently, there exists an unique operator Xp € B(H) such 
that F(|¢)(v|) = (| Xe |d) for all 4,7 € H. Before proving this fact, 
we draw the conclusion; as already noticed, any p € B; (H) can be written as 
P = >>, Tn| On ){ Yn | with the possibly infinite sum converging in trace-norm; 
thus, B,(H)* C B(H) for 


=X rn Flon) (Pnl) = 5 rn (Yn | XF ln) = Tr(Xr p) . 


n n 


The property of sesquilinear forms used above comes as follows: if f : Ht C 
is a continuous linear functional on H, that is | f(¢)| < || f\] |/w||, then Ker(f) 
is closed. Assume Ker(f) 4 H; if ¢ € Ker(f)+, ||¢|| = 1, then f(¢) 4 0 and 


Ker(f) 3 |x) = FAY) — FL) >) => F) = (plt), 


where |$) := f(@)|¢@). It is easily seen that this vector is unique and that 
\| fll = |f()|. Given a continuous sesquilinear form f : H x H+ C, for each 
fixed Y € H it defines a continuous linear functional fy : H ++ H; therefore, 
there exists a unique | Xy) € H such that f(ġ, Y) = fyl) = (Xy |ġ). This 
allows to define a linear operator x} € B(H) such that x} w)=|xy) and, 
whence f(¢,») = (4| X; lo). 


3B, (H) is a two-sided ideal, namely Y X,Y X € Bi (H) whenever X € Bı (H) and 
Y € B(H). 


5.2 C* Algebras 155 


Remark 5.2.3. As B(H) is the dual of B,(H) it can be equipped with the 
corresponding w*-topology (see Remark 5.1.1.5); namely, any p € B,(H) 
defines a linear functional B(H) > X + E,(X) := Tr(Xp). The wx-topology 
on B(H) is the coarsest one with respect to which all semi-norms £,(X) := 
|Tr(p X)| are continuous. By comparing these semi-norms with those in (5.9), 
it turns out that the w* topology coincides with the o-weak topology. 


Hilbert-Schmidt Operators 


A second norm on My(C), also based on the trace, is given by 


l-le : X> |X] := VTX? = 


(5.25) 


It is called Hilbert-Schmidt norm; unlike the trace-norm, it originates from a 
(Hilbert-Schmidt) scalar product 


My (C) x My (©) > (Y, X) = Te(YŻ X). (5.26) 


that satisfies |Tr(Y'X)| < ||Y||2||X Jz. In fact, using (5.16) and (5.1), 


N N N 
THX) s DEA ARAA ENDED Aaa 
N 
< Xle, SB | YIU YG UYY |B) < [Xle [I¥ll2, (5-27) 
j=1 


for U| W; ) (W; |Ut < 1. When defined on B(H) with H infinite-dimensional, 
the Hilbert-Schmidt norm singles out the linear subspace B2 (H) of Hilbert- 
Schmidt operators 


Bo(H) := {xe (H): IXllz < co} . 


If X € Ba(H), |X] = VX Z: 27, with z; the singular values of X. Then, 
inequality (5.27) holds with X,Y € Bo(H), respectively Y € B(H), X € 
B2 (H). B2 (H) is also close with respect to ||: ||2 and thus a Banach *-subalgebra 
of B(H) (actually also a two-sided ideal as B,(H)) without identity in the 
infinite dimensional case for || Illy diverges [251, 270]. 


Example 5.2.5. Let Fj, j = 1,2,...,N*, be a set of N x N matrices, orthog- 
onal with respect to (5.26), Tr(F! Fk) = jx: they form an ONB in My(C). 
Indeed, as a Hilbert space equipped with (5.26), Myn (C) has dimension N?, 
therefore, for all X € My(C), 
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N2 
My(C) 3X =)° T(X FI) F}. (5.28) 
j=l 
Consider the linear map Try : My(C)> My(C) defined by 
My(C) > X > Trn[X] := Tr(X) 1x . (5.29) 


We shall refer to it as trace map. Choose an ONB {|a)}4_, in CX; the 

N? matrix units Eag := |a)(3| also form an ONB in My (C). Thus, dhere 
2 

must exist a unitary matrix U € My2(C) such that Eag = D Vapi F; 

Therefore, the trace-map can be recast as 


Try[X] = 3 Ega X Eag = S s Ute. Uag j F] XF; 


a,ß=1 j= ae ae 


N? 
\OFÌXF,. (5.30) 


i=l 


Remark 5.2.4. The uniform, trace and Hilbert-Schmidt norms are all equiv- 
alent on finite-dimensional H and thus define equivalent topologies with the 
same converging sequences. Indeed, given X € My(C) its norm coincides 
with its largest singular values, ||X|| = maxi<i<n zi, then 


IX SUX, Xh SN IXI IX < Xle; Xl < VN IXI - 


Also, ||X|]2 < ||X||, for the sum of squares of positive numbers is smaller 
that the square of their sum; while from (5.1), 


N 
|X| =S z: < 
i=l 


However, the trace and Hilbert-Schmidt norms are not C* norms; indeed, for 
any NEN, 


XTX |i = 


N N 
Slat AY ci, |XX] = 
j=l i=l 


Therefore, the trace-class and Hilbert-Schmidt operators form Banach *- 
algebras but not C* algebras. 
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5.2.2 Positive and Completely Positive Maps 


Physical transformations of quantum systems are described by linear maps 
acting either on their observables or on their states. As already seen in the 
classical case, these two possibilities are dual to each other; in the first case 
states are not affected, in the second one, states change while the observables 
do not. The physically relevant request is that mean values do not depend 
on which of the two ways they are calculated. 

In the classical setting, states must change while preserving their ultimate 
characteristic of being probability distributions; the maps which describe clas- 
sical state transformations must thus be positivity preserving. In quantum 
mechanics things are more complicated and intriguing; it is indeed neces- 
sary to sharpen the notion of positive linear transformation. This latter is as 
follows. 


Definition 5.2.1 (Positive Maps). A linear map A : B(H) ~ B(H) is 
positive if and only if BUH) > X > 0 => B(H) 5 A[X] > 0. 


Given a positive linear map A, one can always lift it to act on the operator- 
valued algebras My (B(H)) as 


A[X] gi A[X1n] 
A[Xni] «+: ALXwn] 
One may then ask whether idy @ A is positive, too. 


Definition 5.2.2 (Completely Positive Maps). 
A linear map A: B(H) + B(H) is N-positive if and only if 


idy ® A: My(B(H)) + Mn (B(H)) 


is positive; A is completely positive (CP) if and only if it is N-positive for all 
N. A linear map A: B(H) + B(H) is called a CPU map when it is CP and 
also unital, that is AJI] = 1. 


Examples 5.2.6. 


1. Positive maps are hermiticity-preserving, for A[X'] = A[X]'. Indeed, 
any X € B(H) can be decomposed into Hermitian components, X = 
X+Xt X-Xİ 


5 +i z | In turn, X1,2 can be decomposed into positive com- 
ie 
Xı X2 
+ X E X 
ponents X1, 2 = Xt — X19, Xiz = eoi Thus, 


A[X]" = AXT] - AXT] — ¢ AXP] + i AX] = AXi -i Xa] = AX] 
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Positivity corresponds to 1-positivity; complete positivity is stronger for 
there are positive maps which are not 2-positive, a renown example being 
transposition Tə : M2(C) + Mo(C), Te[X] = XT, with respect to a 
fixed ONB. We shall consider the N-dimensional case: Ty is positive for 
transposition does not affect the spectrum of a matrix, but the partial 
transposition idy ® Ty is not positive, whence Ty is not N-positive 
and thus not CP . This can be seen by considering the positive matrix 
E = [E;;] of Example 5.2.3.9. Then, 


N 
V := idy @ Ty[E] = Nidy 8 Tw[PX] = > iX ilg liil (6.32) 


acts as a flip operator on CN @ CN, that is V(Iv) @1¢)) =|¢)@|y). 


Since V has eigenvalue 1 on the N(N + 1)/2 dimensional subspace of 
symmetric states and —1 on the N(N — 1)/2 dimensional subspace of 
anti-symmetric states, it is not positive and Ty not N-positive, hence 
not completely positive. 


. CPU maps satisfy the inequality 


A[Xt X] > ALXTALX] > 0. (5.33) 
In fact, from Example 5.2.3.8 it turns out that 
F ahe wae 0) 20 
Xt xix x) 0) 0 OF = 
Since A is CPU, it follows that 


moar ete) = Cape ate) 2° 


whence, because of the previous point, and using Example 5.2.3.7, 


(ax -0 (oe ape) (441) = ata- apai 


<L 
> 
M 
= 
E 


>0. 


. CPU maps are contractions: since B(H) 3 XX < ||X||?, positiv- 


ity, unitality and (5.33) yield A[X]tA[X] < A[XtX] < ||X||2, whence 
| ALX]|| < IXI. 


. Let B C Aa C subalgebra of a C* algebra A, both assumed with 


identity, the map 7g4 : B +> A denotes the natural embedding of B into 
A; according to Examples 5.2.3.7 and 8, embeddings are CPU maps. 
Indeed, for all A; € A, Bj € Band N EN, 


N 
XO Al wal B! By] Aj = Z'Z>0. 


i j=1 
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6. If A and B are C* algebras (with identity) and A is Abelian, then 


any positive map A: A+» B is CP. 


In order to check whether 


yy”, YI ALXTX,]Y; > 0 for all Xi; € A, Yj € Band n € N we 


4,9=1 


use that Y; ; and A[X}X,] can be identified with functions f(a) € C(¥) 


over a compact topological space ¥. Then 


(do vial AIX X, 1¥5) (2) =5 iG 


a4,g=1 t,j=1 


MIXI Xj](x)¥; (a) 


= A[Z1Z,)(x) > 0, 


where Zs := X; Yi(x) X; € A for all z € X. 

7. If A and B are C* algebras with identity and 6 is Abelian, then any 
positive map A : A > B is CP . We take 6 = B(H) and check whether 
oP jai (Yi | AXİX; |b; ) > 0 for all y; € H, n € N and X; € A identi- 
fied with functions in C(¥). By duality, AT [| Y; )( y: |] gives a complex 


measure on ¥ such that 


n 


i, j=1 i, j=1 


S i | ALXEX] Y) = S f amx (2) X;(x) > 0. 


Indeed, for all c; € C and |W) := Y; cilti), 


D ao f ams) = So (vs AU is) = (Ata) > 0 


t,g=1 gat 


In order to ascertain whether a linear map is completely positive, it seems 
necessary to check N-positivity for all N € N. Luckily, the following result [82, 


83] shows that N-positive maps A: My(C) > 


B(H) are automatically CP . 


Theorem 5.2.1 (Choi). A linear map A : My(C) — B(H) is CP if and 
only if idy Q A[E] > 0, where E is the matrix introduced in Example 5.2.3.9. 


Proof: As seen in Example 5.2.3.9, E> 0; thus the “only” if part follows 
from the fact that if A is CP it is N-positive (see Definition 5.2.2). 

As regards the “if” part, in order to check that A N-positive implies A 
CP , we choose M € N arbitrary and show that idm @ A[X] > 0 for all 
My,(C) @ B(H) > X > 0. Using Example 5.2.3.8, it is sufficient to show that 
yo _ Y{ ALX!X,]Y) > 0 for all choices of Y; € B(H) and X; € My/(C). 


i, j=1 


Then, by writing X; = Dri xt Eke € My(C), from the assumed positiv- 
ity of MEy and Example 5.2.3.7 it turns out that 
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M N M t M 
S yaxx- y (>: “ie AmS al, ¥; 
i j=1 k,lł,s=1 \i=1 j=1 


— 
Zks 


whence A is M-positive for all M € N and thus CP . 


Remark 5.2.5. The matrix X4 := idy @ A[E] € My(Mys(C)) associated 
with any linear map A: My(C) + My,(C) is known as Choi matriz. Theo- 
rem 5.2.1 can then be rephrased as: A: My(C) +> Mm(C) is CP if and only 
if its Choi matrix is positive. 

Vice versa, . X=), Xie oe i deg EQO be an NM x NM ma- 


trix, where BE ij ) and E™ denote the matrix units in M n(C), respectively 
Mm(C). The map Ax : My(C) + Mm(C) defined by linear extension of 


LJSE 


M 
N N : 
Ey ne Ax [EY | = 5 EQD Kisir 3 (5.34) 


r,s=1 


is such that its Choi matrix is X. Denoting by £(N, M) the linear space of 
linear maps A: My(C) + Mm(C), the one-to-one relation 


L(N, M) 534- XE Mym(C) 


is known as Jamiołkowski isomorphism [158]. 


If a map A: My(C) +} B(H) is only positive, its Choi matrix cannot be 
positive, but only block positive, namely only its mean values relative to 
product states in CY & H are surely non-negative. 


Proposition 5.2.1. A linear map A: My(C) + B(H) is positive if and only 
if (Y8 dlidn 8 A[E] |W @ ¢) > 0 for all yp € CX and ¢ € H. 


Proof: Positive matrices can be written as sums of projectors with positive 
coefficients, thus, according to Definition 5.2.1, A : My (C) +> B(H) is positive 
if and only if (¢| Ally) y |] |¢) > 0 for all 6 € Hand y € C™. But, 


(ol Allylle) = Sro ) ($ | AlE] [G2 ) 


tacl 


N 
=(4* 94| D> Ey 8 A[Ey] ly* @ 4) 


ij=1 


= (4* 9 ¢ | idy 8 A[E] Y* @ 4) , 
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where | ~*) = ys w*(i)| 7) with respect to the fixed ONB in C such that 
Eig = |t) l. 
In finite dimension the structure of CP maps can be made explicit by 


means of the Hilbert-structure inherited by My (C) from the Hilbert-Schmidt 
scalar product (5.26). 


Examples 5.2.7. 


1. Consider the linear space L(N, N) of linear maps A: My(C) —> My(C); 
it can be given a Hilbert space structure by means of the Hilbert-Schmidt 
scalar product of their Choi matrices, 


<< Ay|Ag >>:= Tr(idy @ A,[E}t idn @ Aa{E]) 


Consider an ONB {F;}®2, in My(C) and the maps 8i; € £L(N, N), de- 
fined by Pij [X] := Fİ X Fj. They satisfy << Gj; | Dye >>= 5in5j¢ and 
therefore form an ONB in L(N, N), whence 


N2 
A= 5 Li Fi X F; ; Li; =<< Piz | A>>= Tr(F! A[F;]) 5 (5.35) 


ij=1 


for all A € L(N, N). If A preserves hermiticity, the N? x N? matrix of co- 
2 

efficients A;; is Hermitian and can be diagonalized, Li; = ae lk Vi Vkj- 

Suppose the eigenvalues ¢; are positive, then 


N? N2 
AX] =S°GLXG., Gri= Vle X Veg F}. (5.36) 
k=1 


j=1 


Using Example 5.2.3.8, maps I € L(N, N) of the form T|X] = G XG 
are easily proved to be CP . Therefore, linear maps A: My (C) + My (C) 
of the form (5.36) are CP and CPU if >’_, GG, = 1y. Notice that the 
decomposition (5.36) is highly non-unique; another possible decomposi- 
tion is indeed provided by (5.35) if the matrix [L,,] is positive. 

2. Let Try : Mn(C) + My(C) be the trace map of Example 5.2.5 and 
consider the reduction map [151] A: Myn (C) — My(C), 


A[X] = Ty xia (5.37) 


A is positive, but not CP ; positivity follows since, if X > 0, Try [X] is not 
smaller than any of the eigenvalues of X. On the other hand, using (5.17), 
the Choi matrix of A turns out to be idy @ ALE] = Íy: — NPN, it has 
a negative eigenvalue 1 — N, whence A cannot be CP. 
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3. The transposition Ty : My(C) + Myn(C) is the paradigm of a map 
which is positive but not CP (as it is not N-positive); by combining Ty 
with A in (5.37), A := Ao Ty : My(C) > My(C) is CP . Indeed, 
using (5.32), idy @ A|E] = idy @ A[V] = Iy2 —V > 0, for the eigenvalues 
of the flip operator are +1. 


The next two results [287, 180] show that the CP maps are completely char- 
acterized by a structure as in (5.36). 


Theorem 5.2.2 (Stinespring Dilation). A unital map A: B(H) + B(K) 
is CPU if and only if there exists a triplet (Ky, 7a(B(R)), Va), where Ka 
is a Hilbert space, Va : K — Ky an isometry and 7, : BUH) — B(Ka) a 
representation of B(H) on Ky such that 


A[X] = VÍ wa(X) Va . (5.38) 


The triplet (Ka, na (B(K)), Va) is unique up to unitary equivalences. 


Proof: If A has the form (5.38) with VIVA = Ik, Example 5.2.3.8 shows 
that Ais CPU. To prove the converse, consider the linear span of all elements 
of the form X @ y, X € B(H) and w € H, 


and the bilinear form (-|-) 4:8 x 8+ C defined by 


(Xoy, Y Od) (X@V|YOo)a=(PlAIX'Y]|6). (5.39) 
If A is CP , this bilinear form is positive on B x B (see Example 5.2.3.7), 
N N N 
(X Xi @ bi] X Xa = X (di | AXX vy) 
i=1 j=l ij=l 


N 
=(4| X Ey 8AE) 20 VNEN. 
ij=1 

By considering the quotient of B by the kernel of the bilinear form, (5.39) 
gives a scalar product on the linear span B4 := B/Kern((-|-}4) of the 
equivalence classes [X @ w] (see the discussion after Definition 5.3.5). Set Ky 
equal to the closure of B4 with respect to the scalar product (-|-), and let 
ma represent B(H) on Ky by ma(X)[Y 8 ¢] = [XY 8 ¢]. Then, the linear 
maps Va : K+ Ky, and vi :Ka |e K, 


Valo) =[1@¢), Vi[X@¢]=A[X]l¢), 


define an isometry (A is CPU) and V/1(X)Val|¢) = VIIX @ ¢] = A[X]| 4) 
for all ọ € H. 
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Example 5.2.8. That CPU maps are contractions (see Example 5.2.6.4) 
comes easily from Stinespring dilation. In fact, V, is an isometry, thus 
VaVi < 1, whence 


A[XtX] = Vi ral Xtra X] Va > Viral Xt] Va Vira [X] Va = A[X] ALX] . 


Proposition 5.2.1 (Kraus Representation). A: B(H) + B(K)) is 
CPU if and only if it admits a Kraus representation of the form 


=X exe. (5.40) 
J 


where the Kraus operators G; : K +> H are such that, if infinite, the sum 
converges in the strong-operator topology. 


Proof: The Stinespring representation is of the form B(H) & lg on HQ K, 


for a finite dimensional or countably infinite Hilbert space K. seep an ONB 
{| j)} in K, the isometry V4: K > H® K and its adjoint Vi: H@KHK 
read 


Vale) =) Gil) 813); — VALWOS)D lA) Gily), 


J 


with Gj: K > H and X; G!G; = Ik. 


Remarks 5.2.6. 


1. Using (5.36), the composition of CPU maps results in a CPU map. Indeed, 
let 412 : B(H1) > B(H2), 4121X] = Z; G3 (4) X Gras), X € BCH), and 
Ao : B(H2) + B(H3), Aos[Y] = 3>, G} (k) Y G23(k), Y € B(Ho), then, 


A23 © A12[X] = X` G} (Jk) X Gis (jk) , 
j,k 


with new Kraus operators Gi3(jk) := Gi2(j)Go3(k). 
2. Let A: B(H) + B(H) be CPU and T the transposition with respect to a 
fixed ONB in H; while AoT need not be CPU, To AoT surely is; indeed, 


To AoT[X] = 5; T|} [XT]G |= D GT XG, with T[X] =: XT, 
X € B(H), the transposed of X and X* := (X7)” its conjugate. 
3. In full generality, a CPU map A: B(H) + B(K) has the form 


=) Cyl X Lj, 


ig 


with Jij OIL; = Ix and C = [C;;] a positive matrix, from which 
the diagonal Kraus representation is achieved by diagonalization as in 
Example 5.2.7. 
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While the structure of completely positive maps is fully under control, it 
is not so for positive maps which are still somewhat elusive. For instance, 
if the coefficients matrix C = [C;,;] is Hermitian but not positive, by 
grouping together its positive and negative eigenvalues, ck, A can always 


be written as the difference of two CP maps, 


AX] = So cn GE XG. -— SO lerl GE XG: . (5.41) 


ch20 Ck <0 


For instance, let {F;}M be a Hilbert-Schmidt ONB in My(C) with 
F, = 1y/N; using (5.30), the reduction map in Example 5.2.7.2, which 


1 
is positive, but not CP , reads A[X] = (= z 1) X4+ OFX AK, 


N2 
1=2 


If there are no negative cz, then A is completely positive; if not, no general 
rule exists to deduce from the cg whether A is a positive map. 


Conditional Expectations 


Particularly important CPU maps are the so-called conditional expectations 
which are the non-commutative counterparts of the Radon-Nikodym deriva- 


tive in (2.51). 


Definition 5.2.3. /117] A positive, unital linear map E: At B C A where 
A and B are C* algebras with identity is a conditional expectation of A onto 


B if 


[AB] = 


[AJB for all A€ A and BEB. 


Proposition 5.2.2. Conditional expectations enjoy the following properties: 


AJ} =E[AT] VAEA (5.42) 
BA] = BE[A] VAeEA, BEB (5.43) 
oE =E (5.44) 
Al A] > EJA] EJA] VAEA (5.45) 
||E|| =1. (5.46) 
Further, E is a CPU map. 
Proof: Property (5.42) comes from positivity as in Example 5.2.6.1; prop- 


erty (5.43) is a consequence of (5.42): 


Property (5.44) follows from E 


[BA] = E[(BA)t]t = EJA Bt]t = (EJAt] Bt)! = BEJA] . 


for all A € A; in order to prove 


1E[A]] = E[A 


property (5.45) consider A — E[A] and use positivity and (5.43), then 
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0 < E[(A — E[A])'(A — E[A])] = E[Al A] — E[A]' E[A] . 


Property (5.46) results from the previous property and positivity: 


Al A < I Al? = EJA] E[A] < E[At A] < IA]? 


Complete positivity is a consequence of (5.43) which yields 


(|B, ABs] = Bı (| A] B2 y Bi 2 EB, AEA. 


Therefore, from Examples 5.2.3.7 and 8, 


N N 
>> BÌ EJA! Aj] B; = X E[B] Al A;B; = E[Z' Z] 20, 
ij=l ij=1 


for all B; € B, Aj € A and N €N, where Z := Yo), AjBj. 


Because of the properties 3 and 5, conditional expectations are also called 
projections of norm one. 


Remark 5.2.7. In case A is a von Neumann algebra and Ag C A a von Neu- 
mann subalgebra, one call conditional expectations all projections of norm 
one which are also normal. This latter property of linear maps A: A; +> A» 
between von Neumann algebras amounts to the following [64]. Let {A,}, 
be an increasing net of operators in A, that is a set of operators indexed 
by a set of indexes u € M equipped with a partial ordering < such that 
Hy S po => A,, < A,,. If the net {A,},, has an upper bound, then 
it has a least upper bound A € A to which the net converges strongly: 
s—lim, A, = A. Then, A is normal if for all nets {A,,},, with an upper 
bound lim, A[A,,] = Aflim,, A,,]. 


Examples 5.2.9. 


1. Let {Pi}icr € B(H) be orthogonal projections P;P; = 6;;P; such that 
Mier i = 1, then E[X] := Jier Pi X P; is a conditional expectation 
from B(H) onto the Abelian subalgebra P generated by the P;. Indeed, 


it is positive, linear and writing P 3 P= > jer PjPj, it turns out that 


[XP] = X. pP XP P= > PAP, = E[X]P . 
i, jEI iEI 


2. Consider two finite-level systems A and B described by matrix algebras 
Mn, (C), respectively Mn,(C); let Tra denote the normalized trace map 
performed with respect to party A, namely 


= 1 
Tra(X4) = z Tala) la . (5.47) 
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A linear map from the matrix algebra M,,, (C)®Mn, (C) of the compound 
system A+ B onto the subalgebra 14 ® M,,,(C) is obtained by defining 


[Xa 9 Xp] :=Tra(X4)@Xp VAEM,,(C) BEM,,(C) 


on tensor products and then by extending it by linearity and continuity 
to the whole of Mn, (C) ® M,,(C). Any 0 < X € Mn, (C) ® M,,(C) can 
be written as )>,(X4)' X4 8 ER by means of a system of matrix units 
{EB} 4-1 in Mm (C) (see Examples 5.2.3.7 and 8). Thus, one verifies 
that E is a positive linear map; indeed, 


(Yo |E[X] Wn) = X Tea((X4)' X4) (We EE |e) 


j=l 


where Y4 := D; wi, XÍ with WZ the j-th component of |Wg) € C™ 
along the ONB associated with the chosen matrix units. By writing the 
identity matrix 1448 = 7721 524 Ef @ EB where {Ef 13-1 is a sys- 
tem of matrix units in M,,,(C), one shows that E[14+4,] = 14+, whence 
z is unital. Furthermore, as any X € M,,,(C) ® Mn, (C) can be written 


in the form X = X, X4 8 X$, it turns out that 


= 


X Xp] = Y Tra(X4) ® X$Xp =E[X] Xz , 
£ 


whence E is a conditional expectation from Mn, (C) ® Mn, (C) onto 14 @ 
Mn, (C). 


5.3 von Neumann Algebras 


In this section, we consider in detail some techniques proper to von Neumann 
algebras which are C* subalgebras of B(H) that are also closed with respect 
to the strong and weak topologies [64, 293, 300]. 


Definition 5.3.1. The commutant of a C* algebra A C B(H) is the C* al- 
gebra B(H) 2 A’ := Ci € B(H) : [X’, X}=0VXeE A}, the bicommutant 


the 


C* algebra B(H) 3 A” := {x" € B(H) : [X”, X']=0YX' € A. 


Remark 5.3.1. Beside being C* algebras, commutants and bicommutants 


are 


also closed with respect to both the strong and weak topology. Indeed, if 


XE 
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= (s,w) — limpo X/, with [X/,, X] = 0 for all X € A, then, because of 


the continuity of the scalar product, 


(WIX, X] |b) = lim (YIX, Xa]le) =0 


for all y, € H, whence X’ € A’. Therefore, A’ and A” contain all those 
operators that can be constructed from operators in A via strong-limits and 
weak-limits. In particular, A’ and A” contain the spectral projectors of any 
of their self-adjoint and unitary elements. 


Examples 5.3.1. 


L; 


2. 
3. 


If A C A’, all its elements commute with each other and A is an Abelian 
C* algebra, maximally Abelian if A= A’. 

The center of A is the Abelian C* algebra Z := AN A’. 

Of the commutative algebras of section 2.2.1, the C* algebra of continuous 
functions, C(&¥), is not maximally Abelian since it is properly contained 
within the C* algebra of essentially bounded functions, L% (&); the latter 
is instead maximally Abelian [293]. 


. If A = B(H), only multiples of the identity operator can commute with 


all bounded operators on H, that is B(H)’ = {1}, the trivial algebra. On 
the contrary, {1}’ = B(H)” = B(H). The same is true for the C* algebra 
of compact operators A = B,.(H), A’ = {1} for the identity is the only 
operator on H which commute with all finite-rank ones; however, unlike 
for B(H), B..(H) c B(H) = B,.(H)” in infinite dimension. 


. Consider the operator-valued matrix algebra My (A) consisting of N x N 


matrices with entries from a C* algebra A C B(H). Let Iyn@A C My(A) 
be the subalgebra whose elements have the form ly & X, X € A. The 


request that [ix ® X, De Ej; ® Xi] = ee Ei; Q X, Aiz] = 0 
for all X € A with X,; € B(H), implies (ly @ A)’ = My(J’). The 
bicommutant (1,@A)” can be identified by imposing that, for all X’ € A’ 
andl<k<QJN, 


N 
[Exe ®X", Y| =F (Er @ X' Xr - Ej 8 XxX’) =0, 


j=1 


where Y = Doai Eij ® Xi; € My(B(H)). This forces Y to be of the 
form Y = FÀ, Exe Q Xun with Xin € A". Finally, [Ei @ 1, Y] = 
Eij Q (Xj; — Xu) = 0 for all i,j = 1,2,..., N, yields X; = X for all i, 
whence (18 A)” = Iy 8 A”. 


. Given w € H, let HY C H be the closure of the linear span of vectors of the 


form X| yY), X € A, with A C B(H) a C* algebra, and by PF :H => HY 
the corresponding orthogonal projector. Since AHG C HY, it follows 


168 5 Quantum Mechanics of Finite Degrees of Freedom 
that PAX På = X Bh for all X € A, whence, by taking the adjoint 
X Ph = PIX Ph = (Pix! P$) = (Xt PHY = Po X for all X € A. 
Therefore, Ps € A’ and På € A” for all y € H. 


Definition 5.3.2. A vector y € H is cyclic for a C* algebra A C B(H) if 
HY = H, separating if X| yY) = 0 => X =0 for all X € A. 


Being cyclic and separating are related properties; indeed, suppose w € H 
to be cyclic for a C* algebra with identity A C B(H) and X’'| y) = 0 for 
X' € A. Then, 0 = AX'| y) = X'A| y), whence X’ = 0 for cyclicity 
of y € H amounts to A| y) being dense in H. Vice versa, suppose w to 
be separating for A’, but not cyclic for A; then, A’ 5 1 — Py # 0, but 
| W) =|) since we assumed 1 € A, which is a contradiction. 


Lemma 5.3.1. Let A C B(H) be a C* algebra with identity; then y € H is 
cyclic for A if and only if it is separating for A’. 


Cyclicity refers to the possibility that, by acting on some vectors with all 
the operators of a given algebra, one gets a dense subspace whose closure is 
the whole Hilbert space. This is the case with the vector |1) € L% (4) in 
the Koopman-von Neumann formulation of classical mechanics (see Exam- 
ple 2.1.1). By acting on | 1) with the simple functions, one gets a dense linear 
span, whose closure is the whole of L? (X ). The same is true using continuous 
functions f € C(X) or essentially bounded functions g € L% (&). 

Differently, if a vector w € H is not cyclic for A C B(H), then På projects 
onto a proper A-invariant subspace Há C H. The absence of proper invariant 
subspaces with respect to A is related to the triviality of the commutant A’. 


Definition 5.3.3 (Irreducible Algebras). A C* algebra A C B(H) is ir- 
reducible if only H if all YW € H are cyclic for it. 


Lemma 5.3.2. A C* algebra A C B(H) is irreducible if and only if A’ = {1}. 


Proof: If A’ = {1} then Pi = 1 for all y € H. If all ọ € H are cyclic for 
A and A’ Æ {1}, then, according to Remark 5.3.1, there exists a projection 
A’ > Q # 1. Therefore, if Q|w) = |Y), then QAI Y) = Alw); thus, the 


closure of A| 7) cannot equal H. 


Given the commutant and bicommutant of a C* algebra A C B(H), we 
can continue and consider the commutant of the bicommutant and so on. 
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Notice that, if A C B are two C* algebras acting on H, then B’ C A’. Thus, 
from AC A” it follows that A” C A’; however, A’ C (A’)” = A”, whence 


A € A” -_ Av —_ Av =., A Al” AY Ave oe (5.48) 


The closure properties of commutants and bicommutants discussed in 
Remark 5.3.1 are typical of 


Definition 5.3.4 (von Neumann Algebras). A C* algebra A C B(H) is 
called a von Neumann algebra if A = A". In the following, we shall employ the 
symbol M to denote von Neumann algebras, while keeping A for C* algebras 
which are not von Neumann algebras. von Neumann algebras M with center 
(see Example 5.3.1.2) Z = MOM = {A1} consisting of multiples of the 
identity are called factors. 


Theorem 5.3.1 (von Neumann Bicommutant Theorem). A C* al- 
gebra A C B(H) with identity 1, is a von Neumann algebra if and only if it 
is strongly and weakly closed. 


Proof: As the bicommutant A” is the commutant of the commutant it is 
strongly and weakly closed. Let A”, A° denote the weak and strong closures 
of A; the strong topology is finer than the weak one, thus AS C AY C A” 
(see Remark 5.1.1.3). Therefore, we need only show that if A = A* then 
A = A”; in other words, we have to prove that A is strongly dense in A”, 
namely that in any strong neighborhood U3(X"), X” € A”, of the form (5.7), 
there is an X € A. In order to do so, as a first step, note that, according 
to Example 5.3.1.6, given Y € H, P € A’ and P|) = |) for 1€ A; 
thus, PAA" |b) = A" PAID) = AM |y} Cc HY; this implies that for any 
€ > 0 and X” € A” there exists X € A such that ||(X — X”)|~)|| < e. 
The second step is to extend this result to generic strong neighborhoods; for 
this we use (5.15), Example 5.3.1.5 and the previous arguments with 1,, Q A, 
M,(A’), (In Q A)” = In @ A” and w replacing A, A’, A”, respectively 4. 
Then, for any € > 0, X” € A” and ý = Xi; |i) 2 |Y), there exists X € A 
such that 32", ||(X" — X): )l] Se. 


Examples 5.3.2. 


1. The previous proof shows that by considering the strong closure of a C* 
algebra A C B(H) one obtains the bicommutant A” C B(H). 

2. Since M C M! => Z = M, Abelian von Neumann algebras can be 
factors only if trivial. 

3. In the Koopman-von Neumann formalism, C(4) is a C* but not a von 
Neumann algebra; actually C(¥)” = L% (4) for the algebra of essentially 
bounded functions on L? (X ) is strongly closed by construction. 
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4. If M is an irreducible von Neumann Algebra (see Definition 5.3.3), then 
it is a factor, whereas the opposite is not true in general. However, if M 
contains a maximally Abelian von Neumann algebra M = N’ C M, then 
M’ CN and Z = M’, whence, if M is a factor it is also irreducible (see 
Lemma 5.3.2). 

5. B(H) is a von Neumann algebra since any bounded operator can be con- 
structed by closing the * algebra of finite-rank operators on H in either 
the weak or strong topology. Their uniform closure instead yields the C* 
algebra B,,(H) of compact operators which is not a von Neumann algebra 
for B..(H)” = B(H) > B(H), in infinite dimension. 

6. Let M C B(H) be a von Neumann algebra, E € M an orthogonal pro- 
jector and consider E M E; this is a von Neumann algebra acting on FH 
with commutant (E M EY = EM’ E(= EM' = M' E). Indeed, for all 
Xe Mand xX’ eM’, 


(EXE)(EX'E) = EXEX' = X'EXE = (EX'E)(EXE) , 


thus EM E C (EM’EY. On the other hand, if X = XE € (EM' EY 
it commutes with E and, for any X’ € M’, 


XX'=XEX'=XEX'E=EX'EX =X'X, 


whence (EM'EY C EME. 

7. Suppose M C B(H) is a factor von Neumann algebra and consider the 
algebra MUM’ consisting of operators of the form }), Xj X; with Xj; € 
M and X; € M’. Then, (MU M'Y = M’! A M" = M AM' = {Al}, 
whence (M U M’)” = B(H). 


5.3.1 States and GNS Representation 


The C* algebras so far considered had concrete representations by means of 
bounded operators on given Hilbert spaces H; more in general, the notion of 
C* algebra in Definition 5.2.1 can be formulated in purely algebraic terms. 
What one needs is the abstract setting at the beginning of Section 5.2 and an 
abstract definition of states, along the lines developed for classical systems 
in Section 2.2.1. 


Definition 5.3.5. A state on a C* algebra A is any positive, normalized 
linear map w : A =œ C; namely, w(YY) > 0 for all Y € A and w(1) = 1. 
States are also known as expectation functionals. 

A state w is pure on A if the only positive, not necessarily normalized, 
functionals p : A +> C such that u < w have the form u = Aw for some 
O<A<1. 

States w such that w(X'X) = 0 4> X =0, ¥ € A, are called faithful. 
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From positivity it follows that states are automatically continuous function- 
als; indeed, if w is a state on A, then 


(w(XTY))* =w(YTX), Jo( XY)? <w(XTX)W(VIY) . (5.49) 
In order to prove this, one chooses ÀA € C and consider 
0 < w((X+AY)1(X+AY)) = w( XTX) +rAw(XTV) 4AM w(VTX)4APW(VTY) . 


Then, the equality comes from setting A = 1,7, while the inequality from 
choosing À equal to the conjugate of the phase of w(XTY). 

The bilinear map A x A Ə (X,Y) w(XTY) would be a scalar product 
on A as a linear space, were it not for the fact that, in general, w(X'X) = 
0 even if X Æ 0. In order to circumvent this difficulty, one considers the 
set Z := {x EA: w(XtX) = o}. Because of (5.49), Z is a linear set 
and also close; therefore, one can consider the quotient A/Z consisting of 
the equivalence classes [X], := {x +I:I€ ae X € A. Since (5.49) 


gives w((X + L)I(Y + I2)) = w(XTY) for all 2 € Z, each class can be 
identified with a vector | YX ), Z corresponding to the null vector. It thus 
follows that (5.49) defines a true scalar product over the linear span of these 
vectors, (WY | WY) := w(XTY). Consequently, by closing the linear span with 
respect to the corresponding norm, one gets a Hilbert space Hw. 

Further, it is immediate to represent operators X € A as linear operators 
T.(X) acting multiplicatively on Hu, 


X> ToX),  Ta(X)|Y¥) = |Pxy) - (5.50) 


Since Z is a two-sided ideal in A, the null vector is mapped into the null 
vector and 7,,(X) is a well-defined linear operator on Hw; it is also bounded, 
for (5.49) and XTX < ||X||?11 imply 


la X) YY IP? = oY XXY) < | X[Po(VTY) = XP MPEP - 
Further, 7, is a so-called x morphism, that is 
Tol Xİ) =X), no(XY) =n (Xn (Y) VX,YeA. 


Therefore, 7,, represents A as a subalgebra of the bounded operators on Hw: 


Aw Tul A) C BHo)) 


Definition 5.3.6 (Representations). A «homomorphism (homomorphism 
for short) between two C* algebras Aı 2 is a linear map T : Ay œ A2 that 
preserves the algebraic relations and the adjoint operation: 


(Al) = 1(A1)t , (Ar Bo) = T(A1)n(B1) YA, Bi € A. 
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When a homomorphism is invertible, it is called an isomorphism, automor- 
phism if it maps A invertibly onto itself. 

If Ay = A and Az = B(H), then n gives a representation (n(A), H) as 
a C* (sub)algebra of bounded operators on a Hilbert space H. Two represen- 
tations of (tı 2(A), Hı 2) of A on two Hilbert spaces Hi 2 are equivalent if 
there exists an isometry U : Hy ++ Hy such that 7(A) = Ut m2 (A)U. 


According to Definition 5.3.2, the state | Wy ) is cyclic for m.,(A) on Hw; in 
fact, (X)| YP ) = | WY ) and the linear span of the vectors of the form | YẸ ), 
X € A, is dense in Huo, by construction. Also, the expectation associated with 
w takes the form 


w(X) = (WY |n (X) We), XeA. (5.51) 


We shall set | Qu) := | YY ); from Definition 5.3.2 it also follows that | 2. ) 
is separating for the commutant 7,,(A)’ C B(H.,) 

The previous approach is due to Gelfand, Naimark and Segal and is known 
as GNS construction. [64]. 


Definition 5.3.7. Given the GNS triplet (Hy, nw, Ru), Hu, Tu and R will 
be called GNS Hilbert space, GNS representation and GNS vector, respectively. 


Remarks 5.3.2. 


1. Any triplet (H,, my, 2,) with the GNS properties of (Hu, Tw, Rw) is uni- 
tarily equivalent to it. Namely, there exists an isometry U : H, + Hy 
such that U| Q,) = |.) and m,(X) = Utn,(X)U for all X € A. 
Indeed, because of (5.51) that holds for both representations, the map 
U:H, + H, defined by Ur, (X)| 2) = 7,,(X)| 2, ) is such that 


w(XTY) = (2, | m(X)' TY) | Qu) = (Q | T (X) UUT, (Y) |.) 
= (22, | T (X) T (Y) |2,) 
on the dense subsets 7,,(A)| 2) C Hu and m,(A)| 2.) C H,. Then, U 


extends to an isometry U : H,, + H_; furthermore, on the dense subset 
of m,(Y)| 2), Y € A, 


Ula (X)Um(Y)| Q) = Un, (X)a(Y)| Ru) = Ula (XY)| 2u) 
= U'Un,(XY)| 2) = mX Jr (Y )| 2), 


that is Unu (X)U = 7,(X) for all X € A. 

2. As a *-homomorphism, the GNS representation m, preserves the C* prop- 
erties of A. Therefore m, (A) is a C* algebra, as well as its commutant 
Tw(A)’. The latter is also a von Neumann algebra, this need not be true 
of Tu (A), but it is certainly so of the bicommutant 7,,(A)”, that is of the 
strong closure of 7.,(A) on Hu. If the center Zu := nu( A)” N m,(A) is 
trivial, that is it consists of the multiples of the identity only, then w is 
called a factor or primary state. 
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3. If w is a state on A and v is a linear positive functional on it, ma- 
jorized by w, v < w, then also v satisfies a Cauchy-Schwartz inequality as 
w in (5.49) 4, ju(XTY)|? < v(XTX)V(YIY) < w(XTX)u(YTY). Conse- 
quently, v defines a continuous sesquilinear form on He x Ha so that, from 
Example 5.2.4, v(XTY) = (Q|2,(X)'T’ 2.(Y) |Q), with T’ € BCH). 
Further, from 0 < v(X1X) < w(X'X), for all X € A, one deduces that 
0 < T’ < 1. Moreover, T’ € 7,,(A)’; indeed, 


v( XYZ) = (Q| noX) TTY yno (Z) 
= (Q| ToX) Tu (Y )T'Tu(Z) 


RY =v((YIX)'Z) 
2) = 


whence |T" , 7.,(Y)] = 0 for all Y € A since m,(A)| 2) is dense in Hu. 
4. From the previous result, it turns out that 7,,(A) is an irreducible C* 
algebra (see Definition 5.3.3) if and only if w is a pure state (see Def- 
inition 5.3.5). In fact, according to Lemma 5.3.2, 7,,(A) is irreducible 
if and only if m,(A)’ = {Al}. If m,(A)’ is trivial then v < w implies 
T’ = Al, hence w is pure. On the other hand, if 7,,(A)’! is not triv- 
ial, then there exists some 1 # X’ € 7,,(A)’, so that also X’ + (X^)! 
and its spectral projectors belong to the von Neumann algebra 7,,(A)’. 
Therefore, there must be at least one non-trivial projector P’ € 7,,(A)’; 
also, 1 — P’ = Q’ E€ m.,(A)’, so that one can decompose w into a convex 
combination w = Awp + (1 — A)wa’, where \ := ( Q | P’|Q) while 


Dp := Awp (X) := (Q| P’m(X)|2) (5.52) 
dg = (1 — Awg (X) = (21 Q'Tu(X) 12) (5.53) 


are positive, normalized linear functionals over A which are both ma- 
jorized by w but are not of the form Aw (compare the analogous argument 
in the proof of Proposition 2.3.8). 

5. The previous point is an example of the convex structure [20] of the space 
of states S(A) on a C* algebra A. In more formal terms [64]: given a C* 
algebra A with identity, the set S(A) of its states is compact in the w* 
topology generated by the semi-norms S(A) 3 w > Ly(w) = |w(X)|. 
Moreover, its extremal points are the pure states and S(A) is the w* 
closure of their convex hull. 


5.3.2 C* and von Neumann Abelian algebras 


Let A be an Abelian unital C* or von Neumann algebra. As discussed in 
Remarks 2.2.2.2,3, the algebra C(4) of the continuous functions over a com- 
pact topological space ¥ is a typical example of the first case, while the 
algebra L? (X) of the essentially bounded functions is an instance of the sec- 
ond case. In this section we shall show that these two cases do in fact exhaust 


“What matters in the derivation of (5.49) is positivity and not normalization. 
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all the possibilities: the main technique we shall use is the so-called Gelfand 
transform. 
All multiplicative functionals x : At C such that 


x(AB) = x(A)x(B), x(AT)=x(A* VA BEA 


are known as characters and their set will be denoted by V4. It turns out 
that characters are states on A; indeed, 


x(A) = x14) = x(I)x(A) = x) = 1, 
while, if A> A > 0 then A = Bİ B (see Remark 5.2.2), whence 
x(A) = x(Bİ B) = |x(B)|? > 0. 
Further, A—a, with a € C and A € J, is invertible if there exists B € A such 
that B(A—a) = 1; since, for any x € XA, x(B)(x(A) —a@) = 1, it follows that 


if A — a is invertible then x(A) # a for all x € X4. Therefore, the spectrum 
of A € A contains the values assumed on A by the characters on A: 


Sp(A) 2 {x(A) : x € Xa} - (5.54) 


Examples 5.3.3. 


1. Let A = C(&), then Xa consists of the evaluation functionals (Dirac 
deltas) b2(f) = f(x) for alla € X, f € C(x). 

2. Let A = Dy(C) the algebra of all diagonal matrices on C with respect 
to a given ONB {|i)}%,, namely A 3 A = DÀ} A; En, where {E;;} is 
the associated family of matrix units. Then, ¥4 consists of the maps 


X9(A) := Tr(AE35) = (j| Ald) = Ay - 


Indeed, AB = baer A,B, Ey; = vi A; BE; implies x; (AB) = 
A;B; = x;(A)x,;(B) for all A,B € A. Notice that the multiplicative 
property cannot be true of any pure state |Y )}( 4| € Myn (C); in fact, for 


|) = alt) + Bj) 


(Y| AB |y) = |a|? AB: + |8|? A;B; while 
(pAlb) (YIB |Y) = la|“ABi + |B|*AjB; + |al’l6|? (AB; + Ay Bi) - 


3. Characters behave as tracial states over A, namely y(AB) = y(BA). 
However, the only tracial state on My(C) is given by 


T(X) = TÈ X), sothat T(XY)=rT(YX), (5.55) 
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for all X,Y € My(C). It thus follows that there cannot be characters on 
Mn(C). Indeed, 


1 1 
2 
T(E) = (Eu) = go TE) Bu) = Hp 

therefore the tracial state can be multiplicative only if N = 1. In order to 
show that the only tracial state on My(C) is 7, let us assume that there 
exists another state w such that w(XY) = w(Y X) for all X,Y € My(C). 
Let X = E,; and Y = Exe; because of (5.12), it turns out that 


w (Ei; Exe) = jkw(Eie) = w(ExeEi;) = diwl Ex; ) . 


Thus, w( Fij) = 0 if i # j and w( Eu) = a for alli = 1,2,...,N so that 
w(1) = 1 = a = 1/N. In conclusion, w acts as T on a system of matrix 
units and must thus coincide with it. 


The set of characters is a subset of the unit ball of the topological dual 
of A (Xa C (A*)1); moreover, Xa coincides with the set of pure states over 
A. In fact, if w is a pure state on A, then in the GNS representation 7,,(A) 
is irreducible (m,(A)’ = {Al}), but then 7,,(A) = Aal for all A € A for 
Abelianness implies 7,,(A) C mu (AV. Then, w(A) = ( Ru |Tu(A) |Qu) = Aa 
and 

w(AB) = (2, | t(A)tu(B) | Qu) = AX\4AB = w(A)w(B) , 


whence w € XA. 

Let A* be endowed with the w*-topology (see Remark 5.1.1.5), then ¥4 
is a w*-closed subset of (A*) 1. In fact, if Xn E€ V4 w*-converges to x, that is 
if Xn(A) > x(A) for all A € A, x is linear and also multiplicative; indeed, 


Ix(AB) — x(A)x(B)| < |x(AB) — xn(AB)| 
+ IAI] [xn(B) = x(B)] + IBI] [xn (A) = x(4)| - 


Therefore, by choosing n large enough the left hand side of the inequality can 
be made arbitrarily small. 


Remark 5.3.3. Once the topological dual A* is equipped with the w*- 
topology, its unit ball (A*), is compact by the Banach-Alaoglu theorem [324]. 
As X4 is a closed subset, it is also compact [324]; further, since the space of 
states is Hausdorff, so is ¥4. One can thus consider the C* algebra C(4%,) of 
continuous functions over the set of characters and the corresponding prop- 
erties. For instance, in the proof of Theorem 5.3.2, we shall profit from a 
theorem of Stone and Weierstrass [259] which states that the norm-closure 
of any algebra of complex functions on a compact Haussdorf space ¥ that 
separates points and contains the identity coincides with C(%). 
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Definition 5.3.8 (Gelfand transform). The Gelfand transform is the map 
DT: At C(X,) from an Abelian C* algebra to the continuous functions over 
its characters, defined by 


ASAP TTA(X)=xX(A) Vee ta. 


Notice that I'[A](x) is automatically continuous on 4,4 equipped with 
the w* topology inherited from A*. In full generality, the following property 
holds [64, 300, 324] 


Theorem 5.3.2. Any Abelian unital C* algebra A is isomorphic to C(X,). 


Proof: The Gelfand transform is a x-homomorphism: linearity is evident, 
also I'[A'|(x) = x(At) = y(A)* = ([A](x))*. Moreover, 


T[AB](x) = x(AB) = x(A)x(B) = (TAILTB)) (x) - 
It also preserves the norm; in fact, 


PLAN? = sup [TIA] O9 = sup |x(A)? = IAI? 
XEXA XEXA 


for all A € A. The latter equality is a consequence of the fact that 4 
coincides with the set of pure states over A and that [64, 300], for any A € A 
one can always construct a pure state w such that w(A) = |A]. 

It thus follows that I’[A] = 0 only if A = 0. Furthermore, I[1] = 1 and, 
if x. A X2, then y1(A) = T[A] (x1) 4 x2(A) = L[A](v2) for some A € A. 
One says that [A] separates points of ¥4; thus, the theorem of Stone and 
Weierstrass (see Remark 5.3.3) applies to IA] so that [A] = C(¥4). 


Remark 5.3.4. [324] If A is a generic C* algebra and X one of its normal 
elements (X Xt = X'X), then one can consider the Abelian C* algebra 
A[X] generated by the norm closure of the -algebra of polynomials in the 
commuting operators 1l, X and Xİ. Let us consider the Gelfand transform 
T : A[X] C(X4)x}); to any function f € C(X4/x)) one associates a unique 
element f(X) := I~"[f] € A[X]. This is known as continuous functional 
calculus. Consider, for instance, the function f(z) := (y(X) — z)~+; then, by 
using a power series expansion and the isomorphic properties of I’ and its 
inverse, one obtains 


1 
Xz’ 


1 & 

r| =r, Wr- 
-z 

whenever z > ||X|| = supyex,/x, X(X)|. Thus, from Example 5.2.2.5 it fol- 

lows that the spectrum of a normal X € A coincides with the values assumed 

on X by the characters of X¥ajx]: Sp(X) = {x(X) : x € ajx) 
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Examples 5.3.4. 


1. If A is a finite-dimensional Abelian algebra, then ¥4 contains finitely 
many points (characters) 44 = {x;}%_,. The maps 6; : XA — C defined 
by 5, (xa) = 6;;, are continuous with respect to the discrete topology on 
Xa. By inverting the Gelfand isomorphism, the corresponding elements 
pi := '~"[d;] € A are orthogonal projections: 


pipe = ~*[6;6n] = dir T [64] = Sik De - 


These are minimal projections of A; namely, the only projections in A 
majorized by p; are the trivial one p = 0 and p; itself. Indeed, consider 
two projections q,p in a generic unital C* algebra and suppose q < p; 
then, writing p = q+ (p — q), Example 5.2.3.2 yields 


q>qpq=atap-qga>q for p—q>0. 


Therefore, q(p — q)qg = X' X = 0 with X = \/p— qq whence X = 0 and 
pq = q = qp. If p = pi € A and Ad q < pi, write Iq) = X; mi(q) ôi 
with 7;(q) € R4; it follows that 


a 


Tja] = api) = Tliall [pi] = 7:(q)ð: - 


Since I'[q] = I'[q]?, one concludes that I"[q] = 6; whence q = pj. 

2. Consider X > 0, and the function f(t) = vt, t > 0. Since the spec- 
trum of X is contained in [0,||X||] and thus also Y4;x) © [0, IXI]; 
from Remark 5.3.4 it thus follows that Y := f[X] = PHVA > 0 
and Y? = tjt] = X. We now show that the square-root of X is 
unique; let A 5 Z > 0 be such that Z? = X and let P,(t) be a 
sequence of polynomials on [0,||X||] converging uniformly to vt. Set 
Qn(t) := Pn(t?): limn—+oo Qn(t) = t uniformly on (0, || X'||]. Furthermore, 
from Remark 5.3.4 and Remark 5.2.1, 


X atx = Sp(X) = Sp(Z?) = (Sp(Z))? = {t : t € Sp(Z)} , 
whence 


Z= lm Qn(Z) = lim P,(Z?) = lim f(X) = VX. 
3. The Gelfand isomorphism maps the von Neumann algebra L?°(#’) into 
C(Y) where Y is a so-called extremely disconnected Hausdorff space whose 
open sets have open closures [324]. 


The preceding considerations can be extended to Abelian von Neumann 
algebras M C B(H) on a separable Hilbert space H [324]. Since M is also a C* 
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algebra, one considers the Gelfand isomorphism I": M > C(¥Xm); suppose 
w € H to be cyclic for M and define the linear functional F : C(¥m) > C, 


F(f):= (YIT A). 


This functional is positive since It is an isomorphism and preserves positiv- 
ity; then, Riesz representation theorem [258] (see (2.48)) ensures that there 
exists a positive Borel measure u on Xm such that 


(b| POLE] |b) = I du (2) fle) = alf) . 


The support of u is the whole of Xm, otherwise there would exist Y C Xm 
and a positive continuous f, non-zero on y, such that 


pf) =0 = (VT | VP Ae) . 


Since w is cyclic for M, it is separating for M’ and for M C M’; then 
THF] Y) =0 implies '~"[f] = 0 whence f = 0. For all X € M, 


A dy (x) |I[X] (a)? = (b| OLX TX] y) = XY )IP - 

M 

Then, one can construct a unitary operator U : H > K := L? (4m) by 
extending to the L?-closures of M| Y) and C (Xm) the linear operator defined 
on the latter spaces by U : X|q) — [|X]. It turns out that 


UXU (ry) =U XY =T[XY] = Trixi (r[Y)) , 


for all X € M, whence U X Ut is represented by a multiplication operator on 
C(X). This relation can be used to prove that U M Ut is a von Neumann 
subalgebra of B(K) and since the algebra of multiplication by continuous 
functions is weakly dense in the von Neumann algebra of multiplication op- 
erators by functions in L% (Xm) it follows that this latter is isomorphic to 
UMUİ. 

Even when a cyclic vector for the von Neumann algebra M does not exist, 
one can prove a similar result [324]. 


Theorem 5.3.3. Every Abelian von Neumann algebra M acting on a sepa- 
rable Hilbert space H is isomorphic to some LX (X), where X is a compact 
Hausdorff space and u is a finite, positive Borel measure on X supported by 
the whole of X. 


5.4 Quantum Systems with Finite Degrees of Freedom 


The simplest quantum systems are 2-level systems (the qubits of quantum 
information): their states and observables are 2 x 2 matrices from M2(C) 
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acting on the Hilbert space C?. Though simple, the 2 dimensional framework 
is sufficient to accommodate a variety of rather successful phenomenological 
descriptions as for spin 1/2 particles in magnetic contexts [248, 273, 280], 
for atoms whose ground and first excited states can be treated as isolated 
from the rest of the energy eigenvalues [87], for the polarization degree of 
freedom of photons [272, 312] and for the strangeness degree of freedom of 
neutral K mesons [30, 31]. Recently, even macroscopic systems in particular 
ultracold atoms [191] have been started to be studied as spin 1/2 particles; 
this is the case for the low-lying energy states of Bose-Einstein condensates 
in double well potentials and superconducting boxes near resonance [197, 
316]. The latest advances in the experimental manipulation of atomic systems 
have indeed provided concrete realizations of 2-level quantum systems and 
made them available for the actual verification of central issues of quantum 
information theory [6, 63]. 


The observables of 2-level systems are self-adjoint 2 x 2 matrices acting on 
the 2-dimensional Hilbert space C?; particularly important are the unitary 
and self-adjoint Pauli matrices 01,2,3 that satisfy the algebraic relations 


Ojak = Ôjk l2 + te jhe Oe , (5.56) 


where €jx¢ is the antisymmetric 3-tensor, and Iz denotes the 2 x 2 identity 
matrix. 

When normalized, ©, := o,,/ V2, they become an ONB with respect to 
the Hilbert-Schmidt scalar product (5.26), that is Tr(a.) = dy. Thus, it 
turns out that any X € M2(C) can be written as (see Example 5.2.5) 


3 


X= S\(TG,X)) Gp . (5.57) 
p=0 


It is customary to work within the representation of the eigenvectors of o3, 
|0) = @ and |1) = (2) (the states of a particle with spin 1/2 pointing 


down, respectively up along the z direction in space). Then, the Pauli matrices 
have the standard form 


(01 (0 =i (1 0 
SEN et Si 0> a ha =i 


so that 010) =|1), o|1) =|0) and o2|0) = —i| 1), o2|1) =i] 0). 

The action of a; on the standard ONB amounts to a spin flip; if 0 and 1 
were classical spin states encoding bits, then 0; would implement the NOT 
logical operation: 0 1, 1 — 0 or i œ i ® 1, where © denotes the binary 
addition (addition mod 2). The ONB associated with co, consists of 


4.) lol) 1 1 i= 
pje SN a e a: 
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Their ONB is unitarily related to the standard one by the Hadamard rotation 


il 
1 1 1 = ji 
b= (i 1 )=0k=07 vals) = ALO Yi]j). (6.58) 


A system eee of n spins 1 i? is described by the matrix algebra 
Mon(C) = (M2(C))®”"; denoting by ci, p = 0,1,2,3, the Pauli matrices of 
the i-th spin, the elements of Mən (C) are linear combinations of operators 
of the form Q;— g}, acting on (C?)®”. The so-called computational basis of 
quantum information consists of tensor products of eigenvectors of a3, 


|i) = iia in] = li) Dli) @---|in), iy € {0,1}, 
that are in one-to-one correspondence with bit-strings i”) € Q%”). 

One may interpret them as orthogonal configurations of quantum spins 
located at the integer sites 0 < £ < n of an infinite 1-dimensional lattice. 
Of course, unlike for classical spins, in this case linear combinations of these 
configurations are also possible physical states. Thus, a one dimensional array 
of n spins 1/2 provide a non-commutative counterpart to classical spin-chains 
of finite length n. Interestingly, their algebra Mə» (C) also describes n degrees 
of freedom satisfying Canonical Anticommutation Relations (CAR). 


Example 5.4.1 (Finite Spin Systems: CAR). From (5.56) it follows that 
different Pauli matrices anticommute, 


fo; ; ox} := Oj0n + Ono; = 20jn 1. 


Siig f W 1 _ %—to2 (0 0 
Set o} := —_ = (o a and o_ := o G a These 
matrices fulfil the following algebraic relations 
for, o_} =l; [o4 ; o_| = 03 (5.59) 
c=, les. o] KPT i les. o_| =-20_. (5.60) 


With |0), |1) the eigenvectors of 73, o_|1) = 0, vie o4|1) =|0). 
In the case of n-spin 1/2 systems, let Ty = o$ , o$., 1; denote the spin 


operators relative to the i-th spin. They are identified as elements of Mə» (C) 
by embedding them as 


M>2(C) 2 oi > ee ® oy ® Vist] 5 (5.61) 


where lj := 1; @ lj+1 @--+ My is the tensor product of as many identity 
matrices 1 € M2(C) as the sites in the subset [j,k]. Then, setting 
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i-l i—1 
ü= (Q o$) @o0t® Dest] > al i= (Q o$) 80} 8 lii+1,N] > 
j=l j=l 


one obtains operators that obey the CAR of n Fermionic degrees of freedom: 
fai, al} = aia} + al ai = O17 , far, aj} = faj j al} =0. (5.62) 


Further, the vector state |1)°” := |1) @|1) @---|1) consisting of n spins 
ee” 


n times 
all pointing down, behaves as the vacuum state for it is annihilated by all a;, 


while al, acting on it, creates the vector state with the i-th spin pointing up, 


a;|1)8"=0, al|1)@" =|1---1 O tek, 
ithsite 


There cannot be more than one Pee for each i as from z 62) (a!)? =0. 
Thus, products of the form |]; an Da, i, where i; = 0, 1 and (a e = Il, create 
the computational basis, 


m 
He Jee |e = | iriz in), 


where © denotes summation mod 2. Since 


[AB, C] = ABC — CAB = A(BC + CB) — (AC +CA)B 
= A{C, B} + {A, CYB, (5.63) 


the number operator 


i=l i=1 
. 1,+03 
= >, Ip i-1 & 2 i & Li+i,n] (5.64) 
i=1 


satisfies N [1 59" = 0 and 
[V,a]=-a;,  [N,al]=al, (5.65) 
whence N|i™) = ($= ty) |i"). Therefore, the basis vectors |é)) are 


occupation number states that is eigenstates of N ; they span the so-called 
n 


Fock space H = | vac) eB Hy, where | vac) = |1)®” is the vacuum state 


k=1 
and H; is the Hilbert space corresponding to k Fermi degrees of freedom. 
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As much as one can construct n Fermi creation and annihilation operators 
satisfying the CAR out of n spin 1/2 operators, so one can obtain the n spin 
algebra Mə» (C) out of the creation and annihilation operators of n Fermi 
degrees of freedom. Indeed, from (5.64) one derives of = 2ala; — 1 and 


= (Teaia: — 1)) üy; = (Jeata: — 1)) al : 
j=l j= 


These relations are known as Jordan-Wigner transformations [295]. They 
show that the algebra of n Fermi degrees of freedom is isomorphic to that 
of n spins 1/2, Mə» (C). It is important to notice that one Fermi degree of 
freedom correspond to a totally delocalized spin operator. 


Remark 5.4.1. [290, 291] The kinematical description of n Fermionic de- 
grees of freedom is abstractly provided by a set of n operators aj, al satisfying 
the CAR (5.62) together with the algebra Ar comprising all polynomials 
Pas, al) constructed with them. The Fock one is a concrete representation 


of the aj, al as annihilation and creation operators on a Hilbert space with a 
distinguished vector | vac), the vacuum state, such that aj| vac) = 0 for all 
he Zas 
Because of the CAR relations, in any representation 7 on a Hilbert space 
H, m(a;) and mal) are bounded operators with respect to uniform norm: 
m(ala;) + maja!) = 1 > n(ala;) = ||n(a;)|| <1. (5.66) 
Also, any two irreducible representations (71,2(Ar),Hj.2) of the CAR of 
finitely many Fermions are unitarily equivalent to the Fock one (see Defini- 
tion 5.3.6). Indeed, from the previous example, we know that both represen- 
tations are isomorphic to Mə» (C) for n Fermions; so, the positive operators 
n 


mi(N) = jai mi(a;)'m(a;) have discrete integer spectrum with an eigen- 


value 0. In fact, if 7;(N)|v;) = |v), then (5.65) implies 
=t] EN Ñ, a;l a”! 


a 


so that 
m (Rynt) |) = Arla) y) + [ra waa)" lie) 
= (A—n)r(ai)"| dp) . 


Thus the spectrum is discrete with 0 as its smallest eigenvalue; the cor- 
responding eigenvector | 7?) is annihilated by all 7;(a;) and is unique as 
implied by the assumed irreducibility of the representation 7;. In fact, the 
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linear span 7;(Ar)|~?) is dense in H; (see Definition 5.3.3); therefore, if 
milaj)| i) = 0 for all j = 1,2,...,n, then the same should hold for its com- 
ponent | ¢+) orthogonal to |Y? ); therefore, (d+ | ri(P(a;,a!) |?) = 0 for 
all polynomials in annihilation and creator operators, whence | ¢+) = 0 as 
it would be orthogonal to a dense subset in H;. The eigenvector |W?) is the 
vacuum for the Fock representation 7;; let U : Hı ++ Hz be such that 


Uyi) =Y), Um(P(a;,a}))| YP) = ma(P(aj,a}))| ¥2) - 


Since the scalar products (Y? | ;(P’)'a;(P”) |W? ), with P’ and P” arbitrary 
polynomials, are completely determined by the CAR relations, their values 
do not depend on the representation chosen; that is 


(Y1 | mi (Pm (P") Wht) = (42 | ma(P) T(P") | ) 
= (41 |m(P)TUT Um (P") 41) . 


on a dense set, whence U extends to an isometry from H; to Hy. 


Of course, not all quantum systems with finite degrees of freedom are finite 
level systems. A free quantum particle in one dimension or a one-dimensional 
quantum harmonic oscillator are systems with one degree of freedom, but 
they are described by means of the infinite dimensional Hilbert space of 
square-summable complex functions over R. In quantum information, these 
systems are sometimes referred to as continuous variable systems in contrast 
to spin-like systems whose variables (observables) are instead discrete (N x N 
matrices). 

For continuous variable systems the standard kinematics is more appro- 
priately given in terms of unitary groups of translations in position and mo- 
mentum. The algebraic relations between them are known as Canonical Com- 
mutation Relations (CCR). 

Consider a Hamiltonian classical system with f degrees of freedom and 
canonical coordinates R?/ > (q, p), q = (q1,42,-+-+%f)) P = (P1, P2;---s Pf): 
In standard quantization, one introduces unbounded, densely defined, self- 
adjoint position and momentum operators (qi, Pi) on H = Lig (RF) defined, 
in the so-called position representation, by 


(Yla) =pl), (Piv)(Q)=—t0,(a), YEH. (5.67) 


On a common dense domain, they satisfy the standard commutation relations 
(with h = 1) 


aa] =i, [&,%| aa]. (5.68) 


Unlike, Fermionic annihilation and creation operators, the operators ĝi 
and P; have continuous spectrum and cannot be bounded. Indeed, from (5.68), 
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[& Bt] = imap! = ARNAN > n, (5.69) 


for all integer n. Because of unboudedness, one introduces the one-parameter 
groups of unitary operators {U;(q)}qer and {Vi(p) ner, 


(U) = pata), (Viet) = Prya), 6.70 


where (q;); = ĝij q, in position representation, while 


(Vila) P) = Pyp), (ViPv)()=¥P-P), 67) 


in momentum representation, with (p;); = di; p- 

These semi-groups are continuous with respect to the strong-operator 
topology, whence, by Stone theorem [300], they are generated by self-adjoint 
operators g; and P; 


Ui(q) := exp (iq Pi), Vi(p) := exp (i p&i) . (5.72) 
By writing them as formal series, it can be checked that they implement 
translations in position, respectively momentum: 
URUN) =G +a, Vip) RUS) = P- p. (5.73) 
The CCR can thus be recast as 
Uila) Vi(p) = V;(@) Uila) iZi 


(5.74) 
Ui(q) Vi(p) =e'? Vi(p) Ui(a) 
Set q = (G1, Q2, Eka qf), p = (Pi, P2, aag PF), r= (@, P), and 
W (r) := ef @PHPS) — eir (217) | (5.75) 
O l; : : 
where - denotes the usual scalar product and X1 := 1 0 with ly is 
f 


the f x f identity matrix. These operators are known as Weyl operators; by 
the Campbell-Hausdorff formula, 


exp (A+ B) = exp (—-=|A, B]) exp (A) exp (B), (5.76) 


1 
2 
that holds when [A, B] is a multiple of the identity, they can be recast in 


the form ss a 
W(r) =e 29? ef TP eira, (5.77) 


The Weyl operators satisfy W(r)' = W(—r) and the composition law 


5.4 Quantum Systems with Finite Degrees of Freedom 185 
W(r1)W(r2) = 2°72) W(ri + ro), (5.78) 
where 
O(71,72) := q1 P2 — P12 =r- (Jer), If = e a , (5.79) 


is the symplectic form characteristic of the Weyl relations. It thus follows 
that the x algebra W generated by linear combinations and products of Weyl 
operators coincides with their linear span. 


Remark 5.4.2. Given the x algebra W, one looks for its closure with respect 
to a suitable topology; it turns out that the C* algebra that arises from 
the uniform norm is too small for physical purposes [300]. For instance, one 
would like that two Weyl operators W (r1,2) be close to each other when 
|r: — r2|| — 0; however, whenever rı Æ r2, 


|W (r1) — W(r2)|| = |1- WH (r1)W (r2) = 2, 


since unitary operators have norm 1. 

The fact that the translation groups {U;(q)}ger, {Vi(p) }per are strongly 
continuous, makes it a natural choice to consider the closure W of W with 
respect to the strong-operator topology. Similarly to the CAR , also for the 
CCR of finitely many degrees of freedom all irreducible (strongly continuous) 
representations are unitarily equivalent. In order to show this, one uses the 
so-called Weyl-transform of a function f € L}, (R7/) [300, 142], 


freWaw[(f):= fo f(r) W (=r). (5.80) 


The Weyl transform is such that WH (f) = W(f+), where ft(r) := f*(—r) 
and, by using (5.78), W (f1)W (J2) = W (fi x fe) where 


(fi x f2)(r) := faw filw) fo(r — w) e27wr) 
Let P := W(g) where g(r) = (Z7)? exp(—4|n|?), then, 


P=Pt, PW(r)P=ecalrl’ p, 


whence P is a projection for P Æ 0. Indeed, if W(f) = 0, choose y, € H 
such that Iy := (w| P|) 4 0; then, for all r € R”, 


0 = (Y| PW'(r)W(f)W(r)P |b) = [ew few) (| PW(w)P|¢) 


= Iy,¢ [ew f(w) eT allel? eiar w) 
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Since the integral is the Fourier transform of f(w) exp(—}||w||?), it follows 
that f(r) = 0 almost everywhere, which is not true of g(r). 

Let K C H denote the subspace projected out by P and consider an ONB 
{¢a} in K; since 


(W(r1)$a|W(r2)s) = (Pba|W'(r1)W(r2) |P oo) 
= (algo) rar) em alrs-rall | (5.81) 


the closures Ką of the linear spans of vectors of the form W(r)| ġa), r 
Rf, are mutually orthogonal. Each vector in Ka is cyclic for W e is 
then irreducibly represented on it. Further, K = H; in fact, the orthogonal 
complement Kı is also invariant under W. Restricting W onto K, yields 
another representation, W] , such that the maps f =œ W1 (f) := W(P) K1 
are injective. But this contradicts the fact that W1 (g) = PIK =0. 
Therefore, every strongly continuous representation of the CCR decom- 
poses into an orthogonal sum of irreducible representations. Further, the re- 


lation 
W(r)| Qa) = PW(r)P| ġa) =e allr ll? loa ) 


extends linearly to the whole of Ka, whence P acts as a multiple of the 

identity on each of the invariant subspaces K,. Consider any two irreducible 

representations Wa with their orthogonal projections Pa» = Wa,»(g) onto 

the cyclic vectors Pa| ¢a,b) = | Ga,» ) and their representation Hilbert spaces 

Ka». By linear extension, define the operator Uab : Ka + Ky such that 
UasWa(r)| ba) := We(r)|  ), for all r € R2/; from (5.81) 


(Wa(ri)¢a | Wa(r2)ba) = (Wo(11) G0 | Wo(r2) bo ) 
= (W (r1)ba |U] Uan |W (r2)ġa )- 


This relation extends to Ka, whence Ua» is an isometry such that 
Ul, Walr) Uas = W (r) , Vr e R”, 


so that the two irreducible representations Wa,» are unitarily equivalent. 


As we shall see in Chapter 7, for infinitely many degrees of freedom there 
are inequivalent irreducible representations of the CCR ; this is true also for 
finitely many degrees of freedom when the symplectic manifold is not R?/, 
rather a torus [300, 291] or when strong continuity is relaxed. This is the 
case for a discrete formulation of the Weyl relations that holds for discrete 
variable quantum systems and turns out to be extremely useful for quantizing 
hyperbolic dynamics of the kind studied in Example 2.1.3. 


Example 5.4.2 (Weyl Relations: Finite Dimension). 
Actions similar to translations in position and momentum can also be 
defined for finite level systems, that is on Hilbert space H = CY. Let 
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{|k)}N=) be an ONB , fix Qu,» € [0,1] and consider the following matri- 
ces Uy, Vn € My(C) 


N-1 N= 
Un =e% ie Seti ki kL, Vw =e¥'m X |k)(k-1], 
k=0 k=0 


together with the identification |j} = |jmodN). These operators are uni- 
tary and 
Un|£) =e tut py Yule) sew ilo 41), (5.82) 


Thus, setting n := (n1,n2) € Z?, Un and Vy satisfy the discrete Weyl 
relations 

UX VR = eW imn ymin , (5.83) 
Further, like in the continuous case, it is convenient to introduce the discrete 
Weyl operators 


Wy (n) := et 8 "2 ym yie, (5.84) 

They satisfy Wi (n) = W(-—n) and the composition law 
Wy(n)Wy(m) = è 9m) Wyn +m) , (5.85) 
with o(n, m) := nıma—nəmı. Since || [Un , Vy] || = 2| sin 4], letting N — oo 


one expects to recover the commutative structure of Example 2.1.3. 

In order to be compatible with a finite dimensional Hilbert space CY, 
powers as UÑ and VY must be proportional to the N x N identity matrix 
Iy; in particular, 


UN =e?" dy 5 VAY =e% Dy (5.86) 


Qu,v 


Different choices of a,,, label different irreducible representations Wy“; 
they play a role in the quantization of classical discrete maps as in Exam- 
ple 2.1.3 [98] (see Example 5.6.1). These representations cannot be equivalent, 
otherwise, for a, 4 a/,, there would exist an isometry T such that 


TË UNa T= e271 Ou = UN at = e2T tau , 


where Uy, denotes the operator Uy fulfilling a specific rule (5.86). 
When normalized, the discrete Weyl operators form an ONB in My (C). 
Indeed, using (5.84) and (5.82), it turns out that 


N-1 
Tr(Wiv(n )) = et F nına (£| Un Va? I£) 


T 


— et A (nina+2nı(au+)—2n2av)( g | l+ nə ) 


= n20 5y = 254 (aut) = N bn.o . (5.87) 
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This in turn yields 

Tr (WA (r)Wv(m)) = Nônm , (5.88) 
whence (see Example 5.2.5) 


x=—»> (Te (whin) x)) Wn(n) VXEMy(C), (5.89) 


2 
neZn, 


where Z4 := {n = (ni,n2):0< nj < N-1}. 


Returning to continuous variable systems, a particular vector state in 
H = L? (Rf) is given by the Gaussian function 


da=, G+), (690 


for alli = 1,2,..., f. Notice that in momentum representation g(p), obtained 
from g(q) by Fourier transform, has the same Gaussian form as the latter. For 
reasons which will become immediately clear we shall refer to the Gaussian 
state as to the vacuum and set | vac) := | g). 


Example 5.4.3 (CCR : Annihilation and Creation Operators). 
Given f canonical pairs (@;, Pi), using (5.68) one shows that the operators 


_ G+ iD; + Qipi 


aj > a= ; 5.91 
a a (5.91) 

satisfy the CCR that describe Bosonic degrees of freedom, 
lai, al | = Oi; , fai. a;| T [ai ; a} =0. (5.92) 


The Gaussian function | vac) plays thus the role of the vacuum for the CCR 
as it is annihilated by all a;, a;| vac) = 0. Since 


aja; (a})™ 


vac) = alai, (a )™] 


f ki 
the vectors |k) := | k1, k2,..., kf) = II a) i | vac) are such that 
i=1 t: 
ailk) = vVkilk- 1i), allk})= vkit lilk +l), (5.93) 


where k 1; := (ky,..., 4-1, ki £1, ki+1, ---, kf). They are the orthonormal 
eigenvectors of the number operator, 
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f 
N= X alai, Nik) = ($ klk) - (5.94) 


The occupation number states |k} span the Fock space for the f Bosonic 


f 
modes (degrees of freedom), H = | vac) ® Dp Hn, where Hn is the Hilbert 
n=1 
space of n modes. Unlike for Fermions, the number operator is unbounded 
and the Bosonic Fock space is infinite dimensional for such is the Hilbert 
space of each mode. 
By introducing the 2f-dimensional operator valued vectors 


A:= (ar... afia]... af) , At := Caer renee ; (5.95) 


the Weyl operators (5.75) can be rewritten as 


f djtiPj _ at 4P 

W(r) = e74 = Ile” vz “i Vs where (5.96) 
j=l 

za ee Pec, (5.97) 


while the CCR relations (5.78) become 


eZI A eZIA — 9-32) (23%s) o(ZÏ+Z3)A | 
1p 0 , 
where X3 = 0 i; and (5.97) yields 
Zi z (X3Z2) = —2i3(zF $ z2) = —2i o (r1, T2) . (5.98) 


Of particular interest are the so-called displacement operators 


2 
|z| 


D(z) = 67% -7a = e- gxa! e-z"0 (5.99) 


where z := {a3 € Cf and (5.76) has been used. Their action is as follows, 


1 oy, ; 
D(z)! aj D(z) = X` Paes laj] =a; +2;, J=1,2,...,n, (5.100) 
kj=0 7° 


where dži denotes the map d;,[] = [-z;a + 2% aj, -] applied kj times. 
Given z € Cf and the corresponding displacement operator D(z), us- 

ing (5.96) and (5.97), one finds that it corresponds to the Weyl operator 

W(r(z)) with r(z) = V2(—R(z), S(z)), whence, via (5.78), one computes 


D(z1) D(z2) = e7 241 (2:22) D(z + z2) . (5.101) 
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5.5 Quantum States 


Hilbert space vectors as those encountered in the previous sections are 
the simplest possible instances of quantum states: once it is known that 
a quantum system is in a physical state described by w € H, then the 
system observables X = Xt € B(H) have mean-values, or expectations, 
(X)y = (Y| X |v). With B(H) 5 Py := |Y )(y | the orthogonal projec- 
tor onto | ~), using the trace (5.19), one writes (X )y = Tr(Py X). 

One-dimensional projections Py, are known as pure states and are the most 
informative about the system they describe; they are quantum counterparts 
to the classical evaluation functionals ô+( f) = f(x) of section 2.2.1. Also in 
quantum mechanics, however, what is often practically achievable is not the 
specification of a precise vector state, but only that the system physical state 
corresponds to a projector P; occurring with a certain weight 0 < A; < 1 
within a statistical ensemble J of projectors such that X` jeg Aj = 1. In such 
a case, the state of the system is a mixed state, namely a mixture of pure 
states; relatively to them, observables have mean-values that are linear convex 
combinations of pure state mean-values: 


(X=) Ag (dy |X ly) =Te(oX), p= Dd Ay lve) (dy |. (5-102) 


jed jed 


As a linear convex combination (weighted sum) of projectors, p is a positive 
operator of trace 1, known as density matriz. 


Definition 5.5.1 (Density Matrices). Any positive trace-class operator 
p € Bı(H) with Trp = 1 describes a mized state; let p = $2; 73/75 )(75| 
be its spectral representation with 1 > r; > 0, »; r; =1. Then, p defines a 
positive, linear and normalized functional on B(H): 


B(H) X H+ wX) = TeX) = Yo r (451X183). (6103) 


The set of all density matrices over the Hilbert space H of a quantum system 
S will be denoted by S(S) or by B} (H) and called state-space. 

Its extremal points, those which cannot be decomposed into convex combi- 
nations of other states, are called pure states. 


Remark 5.5.1. The eigenvalue r; of p in (5.103) represents the probability 
to find the system in the state |r; ) once it is known that it is described by 
the density matrix p. However, a mixture as in (5.102) can correspond to a 
convex combination of non-orthogonal projectors P; = |; }( 4; |; this fact 
points to two crucial aspects that mark a substantial difference with respect 
to classical phase-space probability distributions: 1) a same p corresponds to 
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different mixtures and 2) the weights A; of the mixture are interpretable as 
probabilities if and only if 


= (Wr lole) = XCA ld; | ee)? > (5 | de) = Syn 


jEJ 


that is if and only if the vector states corresponding to a given physical 
mixture are the eigenvectors of the associated density matrix. 


Examples 5.5.1. 


1. The geometry of the state-space of two level systems can be simply visu- 
alized. Using (5.57), the density matrices p € M2(C) read 


ro s 1 1 / 1+p3 pi—ipa 
= =-(1+p-0)= -> 104 
p= (2 ai jaiteh to ATi), 6104 


with 0 < r < 1 and r(1 — r) > |s|? for p > 0. Thus, p € R? has length 
0 < |lel? = 1 — 4Det(p) <1. 


Thus, the density matrices of two-level systems are identified by the vec- 
tor p, the so-called Bloch vector, inside the 3-dimensional sphere. The 
pure states are uniquely associated with points on its surface, while or- 
thogonal states are connected by a diameter. 

2. By expanding the exponential operators in (5.99) as power series and 
acting on the vacuum state, the resulting pure state, 


lz? 


|z) := D(z)| vac) =e = eza | vac) = -lz i nE 


j=1 kj=0 
(8108) 
is a so-called coherent state [312], that is an eigenstate of the annihilation 
operators 
ailz) = zil z) rasli (5.106) 


It thus follows that the squared-moduli of the components of z are mean- 
occupation numbers: (z| N |z} = ae |z; |". Coherent states cannot be 
orthogonal to each other for there are uncountably many of them, indeed, 
from (5.101), one derives 

1*2?) 7 $z? — 2)? 


(z!|z?) = (0| Dt(—z1) D(z?) |0) = SC (5.107) 


Nevertheless, with z; = rj exp (i0;) and dz = []j_, rjdrj dð; , 
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1 lp; )( a | q | Pi +45 
=N pele w= a i 1 rjdrje "iriI tI x 
jJ=1 pj ,q5=0 vpjlal 
a dv, e075 (4 TT > |p; )(p;|=1. (5.108) 


j=1 pj=0 
Namely, coherent states form an overcomplete set in the Fock space HY), 


Let at represent the creation operator of a photon in a single mode; 


zr 
a coherent state |z) = e | zl? /2 oe Teg corresponds to a Poisson 
n=0 
em PY tel, 


distribution over the mode number states, |(n| z ) 
n! 


. Passing from annihilation and creation operators to position and momen- 


tum ones, by inverting (5.97) the complex parameters z € C/ correspond 
to points r = (q,p) € R?/ in phase-space, where q := /2R(z) and 

= /23(z). Coherent states are then characterized by gaussian lo- 
calization both in q and p; indeed, if V2zo = do + ipo, then from the 
discussion preceding equation (5.101), using (5.77), (5.70) and (5.90) one 
gets, in position representation, 


—lla—aoll?/2 
ipa (q— e 
(q| zo) = (a|W((—40, Po)) |vac) = e'o‘ GO (5.109) 


while, in momentum representation (see (5.71)), 


e-llp-poll?/2 


(p|zo) = e7 40° (P—-P0o/2) -77 


(5.110) 
The phase-space localization properties of coherent states make them use- 
ful tools for studying the quasi-classical behavior of quantum states [138]. 
In particular, given a density matrix p for a continuous variable system 
with f degrees of freedom, one can compare its statistical properties with 
those of the function R,(q,p) := (2|p|z) [300] which is positive, since 
p > 0, and normalized because of (5.108), thence a well-defined phase- 
space probability density. Viceversa, given a phase-space density R(q, p) 
one can naturally associate to it a density matrix, pr, which is diagonal 
with respect to the overcomplete set of coherent states: 


pr= [dz R2)|2)(2|= f SY Rey) e+ iyat iyl: 
per (2m)f 
(5.111) 
Most density matrices p admit a so-called P-representation [121] as above 
in terms of a function R(x, y) that is summable and normalized, but, in 
general, not positive and thus not a phase-space density. 


5.5 Quantum States 193 


The classical character of coherent states stands in sharp contrast with 
that of number eigenstates: this behavior shows up most evidently when 
photons interact with beam-splitters [123]. 


Beamsplitters 


A beam splitter is an optical device that is used to divide an incoming classical 
light beam of intensity I along a spatial direction 1 into a reflected beam of 
intensity R x I, with reflection coefficient R along an orthogonal direction 
2, and a transmitted beam of intensity T x J, with transmission coefficient 
T along the incoming direction 1. In absence of absorption and dissipation, 
R+T=1. 

Quantum mechanically, one associates photon modes to the two spatial 
directions; in an effective two-dimensional description, a generic single photon 
state |% ) incident upon the beam splitter is a superposition of single-photon 
basis states | 1), |2} describing photons impinging on the beam-splitter along 
the directions 1 and 2. 


a2 
BS 
at T2 bı 
NN eee een > 
N ti 
Tı : to 
2 


Fig. 5.1. Beam Splitter 


In absence of dissipation, the interaction with the beam splitter produces 
outgoing photon states according to the rules 


| 1") == U|1) =t1|1)+ rel2) , | 2") = U|2) = r1|1) +t2|2) , 
where r1,2,t1,2 E€ C are the reflection and transmission amplitudes along the 
directions 1,2. Therefore, a natural matrix U = © a appears which is 

1 t2 


unitary. In fact, by using creation operators al for the two modes, one writes 
|i) =a!| vac), where the vacuum | vac) is the state with no photons. Setting 
|1’) := bÍ | vac) and | 2’) = b}| vac), yields b = tial + reah, bf = rial + teal 
and the Hermitian conjugate linear relations. As the b” create and annihilate 
new photon states, they must comply with the CCR (5.92), whence 
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[br , Of] = |éx?-+|ral? = 1 = [br, Bf] = fta? rel? , [bi , 08] = tiritr3te =0. 


The whole physical process can thus be characterized by the matrix 


f= ty T2 = te’? rer?2 
~ Ari ta) \ rete tet } ? 
where r? + t? = 1 and (¢; + ¢2) — (Yı + Y2) = 7. For sake of simplicity, we 


shall set t = r = 1/2, pı = %2 = 0 and ¢; = ¢, so that ¢2 = T — ¢; in this 
case, the matrix U reads 


= ti 12 _ ES 1 e? 
U= a =F ee i ) . (5.112) 


It describes a so-called 50 : 50 beam-splitter that rotates by @ and m — ¢ 
the reflected and transmitted beams. The transformation a1,2 +> b1,2 can be 
unitarily implemented, namely we can explicitly construct the operator 0 
that sends |1), |2) into | 1’), | 2’). Since the beam-splitter does nothing to 
the vacuum, namely U| vac) = Ut|vac) = | vac), the action of J must be 
such that bı 2 = U aia Ut. Let us consider the operator 


U(z) a e7 01a} —2* alaz z= |z|e’ 


it is unitary and its action can be computed as for the displacement operators 
in (5.100). Namely 


a ie 1 x 
Pleat Oet =S a tal) , del] = [zamat — 2* alan, -]. 
k=0 
Since dz[a|] = zal, and dz|ab] = —z* al, the infinite sums can be explicitly 


computed, the result being 

(2)! = (cos zl) al — ef (sin |z|) af 

(z)t = e~* (sin|z|) al + (cos|z|) ab . 

Therefore, the matrix (5.112) corresponds to |z| = 7/4 and a = 7 + g; set 


U := O(-n/4 exp (7¢)). In terms of Ū it is now easy to check that an incoming 
photon along direction 1 emerges in a superposition of states, 


al +0lah gy = Le) +e%)2) 
a a 


Instead, for a coherent state |a} = D(a)| vac), a € C, one has 


|1’) = bi | vac) = Otal O| vac) = 


Ūla) = Ü D(a) O+ vac) = er aj Uta" Ü ar O" | vac) 
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This means that an incoming coherent state gets split into a transmitted co- 
herent state | J ), of intensity |a|?/2 and a reflected/phase-shifted coherent 
state |e"? 4 )2 of intensity |a|?/2, exactly as with classical light. 

On the other hand, a purely quantum effect results from a beam splitter 
acting on a state | 1112) consisting of two photons coming from the orthogonal 
directions 1,2. It is transformed into |1114} = b!b)| vac); explicitly, 


_ |21)+ | 22) 


The outgoing state thus consists in a superposition of states with both pho- 
tons moving along a same direction; then, photons will always be found to- 
gether either along direction 1, | 21 ), or along direction 2, | 22), with the same 
probability. On the contrary, no experiment can reveal one photon along di- 
rection 1 and the other photon along direction 2. This is because the ampli- 
tude for | 1,12) is the sum of the amplitudes of all processes leading to this 
state; in the present case these are reflections along either directions 1 and 
2 with amplitudes rjrg = —1/2 and transmissions along either directions 1 
and 2 with amplitudes t)t2 = 1/2. These processes interfere destructively. 

The same kind of effect appears in single photon experiments with Mach- 
Zender interferometers. 


fy ae ee aes 1 l 

|1114) = Ual U Ut al U +| vac) = AGI + eah)(—e~**al_ + ab)| vac) 
1 
2 


(e(a)? + atah — aha} +0(af)*)] vac) 


a. ti W 
rı 
BS» 
t2 
M — pd: 
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Fig. 5.2. Mach-Zender Interferometer 


In a configuration as in Figure 5.2, an incoming photon is either reflected 
or transmitted at a beam-splitter BS, with amplitudes rı and t1, reflected by 
perfectly reflecting mirrors M and then either reflected or transmitted with 
amplitudes rz and tz at a second beam-splitter BS2. The outgoing photons 
are then counted by detectors Dj; 2. The probability P; of a photon being 
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detected at Dı is determined by the amplitude at Dı which is the sum of those 
of the processes “reflection at B.S; + transmission at BS” and “transmission 
at BS, + reflection at BS”, that is Pi = |rit2 + tire|?. Analogously, the 
processes “transmission at BS; + transmission at BS” and “reflection at BS, 
+ reflection at BS” contribute to the detection probability at Də, Pz = 
\tyt2 + rıroļ?. One can visualize the entire process by means of a binary tree 
with one level for each beam-splitter. 


BS, 


Dz Dı Di Də 


Fig. 5.3. Mach-Zender Interferometer: Binary Tree 


If both beam-splitters act through a same operator U as before, then, 
taking into account the impinging directions, 


2 


1 ; sal 1 
P, = |5 (== +e) =sin?¢, Po=|5 (14%) = cos? ġ . 


An interference pattern thus emerges which depends on the phase-shift ¢ and 
can then be experimentally controlled. 


Uncertainty Relations 


Beside being necessary for the consistency of the statistical interpretation 
of quantum mechanics, the positivity of quantum states is, together with 
non-commutativity, at the origin of the Heisenberg uncertainty relations. 
Consider the CCR for f degrees of freedom; with the notation of (5.75), 
let p € B] (H) be a density matrix such that all first moments r; := Tr(p7i) 
and all second moments Tr(p7;7;) are finite. Then, the 2f x 2f real matrices 


Gr = [e(o are] EA = [m,e EEr) 


ij=1 


2f 


ij=1 


are both positive. In fact, 


2f 
(u| Clu) = D uju Tr(pF% — r) — 73) = THe Xt X) 20, 


ij=1 
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for all u € C?F, where X := ean uili — ri). Using commutators ([-, -]) and 
anti-commutators ({-, -}), one finds 

Bs A Ifa - Tio 

(Fi — ri)(fj — rj) = G ri), (Fj -r;)} + zA, 75] 


(= rii- ri) = G ri); G- ry)} = 5 [Fi A] 


With J, as in (5.79), the CCR (5.68) read Fi. *] = i(J,);;; it thus turns 
out that the correlation matrix 


pf E P + (one 


CF = |CF E ae eee c$ := si (ol =r); G- r;)} , 


ij 
beside being positive, must also satisfy 


CP + s(h(0 Fi. |) | = Chats S0. (5.113) 


Let f = 1 and choose p = |w)(w |; then, 
i/0 1 
orsi(% te 
(d| (Aq)? |v) (Yl iT, P e) 
(Yl {@, D} |p )/2- i/2 (4| (Ap)? |b) . 
where Ag := ¢—(w|@|wW) and Ap := p—(w|p|w). Therefore, (5.113) implies 


a z 2 
(| AGH) (w| Arey) > PMD TS 


These are the uncertainty relations for conjugate position and momentum. 
In terms of Bosonic annihilation and creation operators, using (5.91) 
and (5.95), the correlation matrix reads 


ve =U CPU, = s(t (of 4 = (Ai), At — (ah) by] , (5.114) 


i,j=1 


1 
where U; = —= E ) ® ly and (AFi = Tr(p 4¥). Notice that the 
matrices V?, 


2f 


V’ = [r (o(a; = (Aido) (4; ~ (4})5)| 


NI = 


ij=1 


and the transposed 


(We)? = = [te(o( At — (i)o) (Ai — (40,)] 


ij=1 
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are also positive. Since [Ai Al] = 6 if 1 < i < f, while [Ai Al] = —Õ;; if 
1+f<i<2f,and 


Ha, At) = A, At — 5 [4s Al] = ata; + aes a], 


it follows that 


1 
Ve >0, VP+55;>0, 3: -n (5.115) 
2 0 =l; 
The correlation matrix of a coherent state p = |z)(z| as in (5.105) is par- 
ticularly simple; indeed, by virtue of a;| z) = z;| z), it turns out that 


(z|ala,;|z) = zz; , (z| aa; |z) = xz; , (z| aja! |z) = fij + 2125 , 


1/1 0 
m ya ee f 
whence V’ = 5 ( 0 i) 


Gaussian States 


In classical probability theory, continuous probability distributions u on R” 
can be described in terms of their Fourier transforms or characteristic func- 
tions [157] 


Fylé) = | dula) €* 


In this way, the moments of the probability distribution can be obtained by 
differentiating F, (£) at € = 0. Similarly, let p be a state of a continuous vari- 
able system with f degrees of freedom equipped with a strongly continuous 
representation of the CCR in the form (5.96), then the characteristic function 
of p is given by 


Ff (r) := Tr(pW(r)) = Tr(pe*"4) =: FY (z), (5.116) 


where (5.96) and (5.97) have been used. 


Remark 5.5.2. The characteristic function FY (r) is the inverse of the Weyl- 
transform (5.80); indeed, 


where the convergence of the integral is understood with respect to the weak- 
operator topology on the representation Hilbert space. The easiest way to see 
this is to call X the right hand side of the previous equality and calculate its 
matrix elements in the position representation (5.67) using (5.70): 
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(a1 X loa) = | yap FEC) (al WU) laa) 


~ J -i FS (r) ôlq -qı + qz) iP Gata) 


r? 
— | = m a— 3P (q1 +42) 
-|r (qı q2, p) e £ : 2. 
By computing the trace in position representation, one gets 


FF (qı — q2, P) = Te(oW(a, - qo,P)) 


41-42 


= f az(x|ple- q+ ga) z-a) 


dp 
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whence, from the representation l —*__¢'P 4 — §(q) of the Dirac delta, 


Cr) 


dp ip- (q—x 
(a lXla)= fax | Eg (æl ple- q +42) = (ail alae) 


Taking derivatives of F] (r) at r = 0 with respect to the real variables 


qi, Pi, respectively of F Y (z) at z = 0 with respect to the complex variables zi, 
z7, one gets the expectation values of all products of position and momentum 


coordinates, respectively of annihilation and creation operators. 
For instance, the first moments arise as follows, 


BaF (2| = Telpa), A: FY l| _ = Teal) , 
while second moments can be extracted from 
1 
aatra = Tr(pajai) = Tle {Ai ; At) 
for l <as f,l+f js 2f, 
1 
a tati- , tT 
a= (oat!) = S(o(4. 41) 
for +f <4 < 97,1 <7 < f, 
F 1 
-4 -n(pad) = 20 (ofa) 
forl+f<i<2f,1+f<j<2f, 
dij i 1 i 
y= apad) =H (ofa) 


forl<i<f,1<j<f. 


O22, Fp (z) 


z=0 


(5.117) 


(5.118) 


(5.119) 


(5.120) 


(5.121) 
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The link with the correlation matrix (5.114) is apparent; indeed, the pre- 
vious moments arise from a gaussian characteristic function of the form 


FY (z) = eZ (A) — 3Z*.(V°Z) ; (5.122) 


where (A), := {Tr(p A;)} 2f; and the vectors Z, A are as in (5.97) and (5.95). 
Notice that (5.114) implies that the sesquilinear form 


(Z1, Z2) = Zi i (V° Z2) 


is symmetric and positive. 

Taking into account (5.91) and (5.97), one passes from the complex vector 
Z = (z*,—z) € C? to the vector r = (q,p) of canonical position and 
momentum coordinates by means of 

1 3 
Z=Uor, t= a(t, “em. 

0 d; 
l; 0 
FY (z) becomes the following Gaussian function of r € RS, 


Since ULU, = -1)) = -1 iF where U; is the matrix in (5.114), 


F°(r) = eir (UP) p) — 5r-(D1C? Sir) (5.123) 


where (P), := {Trom}. 

As in classical probability, the Gaussian form of the characteristic function 
is such that higher moments are determined by first and second moments. 
Obviously, not all quantum states have this property, if they do have it, they 
are called Gaussian states. 


Example 5.5.2. Coherent states p = |u )( u| = D(u)|0)(0|Dt(u), u € Cf 
are Gaussian; indeed, using (5.100), 


H ZA Z*-.(A+U 
Fp (z) = (0| D' (u)e D(u)|0) = (Ole (A+ ) J0) 
= 2" (Q/e2" 410) = e7 U-la? U = (u,u*) € C”. 


The first moments are u = (u |a |u), u* = (u| at |u), the correlation matrix 
ly 0 
Pa 
n 27\ 0. My 
From its characteristic function, one can easily determine whether a given 
Gaussian state p is pure; indeed, by using Remark 5.5.2, one computes 
dr, dr 
Tr(p*) = — 


aye PE (r1) FE (r2) Te(W(-r1)W(-r2)) 


Z dr C r C —r)= dr —r(CPr) _ 1 
= | Gay Fo (e) Fp (r) | oar V4 Det(C?) ` 


Therefore, p is pure if and only if Det(C’) = 47f. 
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A part from their first moments which can always be set equal to 0 by a 
suitable shift operated by a displacement operator D(z) in (5.99), Gaussian 
states are completely determined by their correlation matrix. An interesting 
question is the following one: given a Gaussian function 


F(z) = e7 M eat 0A). (5.124) 


with M € C?f an assigned complex vector and V an assigned (2f) x (2f) 
positive matrix such that the associated sesquilinear form is symmetric, 


Z* - (VZ) = Zš- (VZ:), (5.125) 


is F(z) the characteristic function of Gaussian state p with correlation matrix 
V and first moments given by the components of M? 

The answer is that it is so if and only if V satisfies the conditions (5.115). 
While necessity descends from the uncertainty relations, sufficiency comes 
instead from the following general result [143]. 


Proposition 5.5.1. A function Rf > r =œ FC(r) (Cf > z =œ FY(z)) 
is the characteristic function of a quantum state p of f degrees of freedom 
satisfying the COR if and only if 1) FO(0) = 1 (FY (0) = 1), 2) FF (r) 
(FY (z)) is continuous at r = 0 (z = 0) and 3) for any n-tuple {r;}"_,, 
ri € R”, Qzi} 1) the nxn matriz F? (FV ) with entries (see (5.98)) 


Fa = e720 (rora) F? (r; — ri) (A= = e371 (2:2 3) FO (zj — zi i) 
is positive definite. 
We postpone the proof of the proposition and instead show that if the 
positive (2f) x (2f) matrix V in (5.124) satisfies V + 433 > 0, then there is 


a (Gaussian) state p with F(z) as characteristic function. 
We just need to consider condition 3) and prove that 


Ss e(Z3-Zi)-M e-}(Z}-Z})-(V(Z;-Z:)) gh Bi(BZs) > 0 , 


j=l 


for all choices of n complex vectors z; € Cf in Z; = (z*,—z;). Because 
of (5.125), we shall thus show that 


. *M-1zZ*. l 
J wj wj ef (V+2¥3)Z;) >0, where w;:= eZ M-21 (VZ) | 


4,g=1 


Since V + 305 is positive, the same is true of the n x n Hermitian, positive 
definite matrix A := [A;j], with entries Aj; := Z7-((V+$3)Z;). Then, con- 
sider the spectral decomposition of A with eigenvalues ag > 0, L = 1,2,...,n, 
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so that Ajj = Wa ape pe; where wy»; is the i-th component of the ¢th 
eigenvector of A; then, 


n . g oo 1 t k 
2 u a > [e 
k=0 


i, j=1 £1 ,00,...€h=1j=1 


>0. 


n k 2 
dw [[ ves) 2 


i=1 r=1 


Proof of Proposition 5.5.1 If FY (z) = FY (z) in (5.116), then condition 
1) in the statement of the proposition is satisfied because Tr(p) = 1, while 
condition 2) is fulfilled since we assumed a strongly-continuous representation 
of the CCR and p is a trace-class operator, so that 


FY (z)-1| < eu 


j 


(rile? A- ijr) 


? 


where rj and |r; ) are eigenvalues and eigenvectors of p. As regards condition 
3), observe that using (5.116) and (5.78), it turns out that 


Sy =Tr(pxt x)>0, 
g= 


where X := X`;_] ui exp(Z? - A). 

The sufficiency of conditions 1), 2) and 3) is shown by using them to 
construct a strongly continuous representation of the CCR by Weyl operators 
W (r) on a Hilbert space H and a density matrix p on H such that F°(r) is 
of the form (5.116) [143]. One starts by defining the operators 


(Wo(rw) (w) =e 2° y(wer), reR?, (5.126) 
on the functions on R?/; these operators satisfy the CCR (5.78): 
(Wo(r1) Wo(r2)) (w) = 67270172 (Wo(rs + r2)8) (w) . 


Then, one considers the linear span Ko of all functions on R?/ of the form 
w) = Soe exp(—50(rre, w)) ; 


for all finite n € N, and defines on it the sesquilinear form 


nı nə 


(Yo, | Wo, ) F =) (c) )* grr pee rere) ; 


i=l j=l 


where z;,; are related to r;; via (5.97). Because of the positive semi- 
definiteness of F°, (Wo | Wo ) p > 0; thus, in analogy to the GNS construction, 
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one takes the quotient of Ko by the kernel consisting of those Wo such that 
(Wo |%c)r = 0 and then its completion with respect to the scalar product 
defined on the quotient by (-|-). This gives a Hilbert space K containing 
the constant function I(r) = 1 on RF for (11| 1) = 1; similarly to the GNS 
vector, | 1) is cyclic for the family of operators Wo(r) since 


n 


= X Ck exp(-Żo(re, w)) = 5 Ck (Wore) 1) (w), 
k=1 


k=1 


and I 
(1|Wo(r)|1)F = (D| e727) )p = F(z) ; (5.127) 


Further, (5.126) yields 


(Wolr Yo, )(w) -Y ae żo( TkT) g —so(r;,+7r,w) 
whence 
(Wo(r)¥e, | Wo(r) Por )w = X (a) h etr Cin) etoi) 
ij 


i 1 2 
x FO (r? _ ri) eo ars +r,r;+r) 


The operators Wo(r) can thus be extended to unitary operators on K where 
they provide a representation of the CCR . If the latter is strongly continu- 
ous, then, from Remark 5.4.2, it reduces to an orthogonal sum of unitarily 
equivalent irreducible representations. Namely, there exists an irreducible rep- 
resentation of the CCR on a Hilbert space H by Weyl operators W(r) and 
an isometry U : K > H:= @, H such that 


Wo(r) =UtW(r)U, Wr) := Pwr) 


Also, U| 1) = @,, | Yn) is a normalized vector in H, namely yea? = 1, 
so that p := 5°, A, Pn is a density matrix on H, where An := ||Yn]|? and 
Pa := | Yn} Yn | with Yn := Un/||qn||. Now, from (5.127) it follows that 


Tr(pW(r)) = Eyn IW) lbn) = (UD Wr) U) 


n 


= (1| Wo(r) |1) = F(z) , 


which concludes the proof. In order to show that the representation of the 
CCR by the Weyl operators Wo(r) on K is strongly continuous, we shall first 
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show that the condition 1), 2) and 3) ensure uniform continuity of FO (r) at 
any r. Indeed, choosing vectors rı = 0, ro = wand r3 = w, it turns out that 


1 FO (u) FO (w) 
u) 1 FO (w —u)e 27) 


FE = Fo(- . 
FC(—w) F°(u— w)e27%) 


its positivity implies FO (—u) = FC(u)*, |F] (u)| < 1 and 
2 
FC(u) — F° (w)| <1- | F° (u @ w)| 
— 2K {(F°)*(u)F°(w) [1 — F? (u — w) ere) } 
<4 |1 — FO (u — w) eo) : 


Then, the strong continuity of the representation is a consequence of the fact 
that, for all Wc € Ko, the contribution 


(Wo | Wo(r) |Wo)r = G Cj e7 B(9(ra.7)te(ri rs +r)) Fo (r; +r-ri) 
i,j 


goes to (Wo |o )r when r — 0 in the equality 


(Wor) — DI Zo) = 2((¥e |e) — KWo | Wo(r) We yr) 


and that this result can be extended to the whole of K. 


Examples 5.5.3 (Two-Mode Gaussian States). 


1. Consider two bosonic modes (T = (G1, G@, P1, P2)) in a state p with Gaus- 
sian characteristic function, 


— irp. PSr 
Fy (r) = e7 37 (210° Fir) , Rt>r= (q1, q2, P1; P2) - (5.128) 


With respect to (5.123) (7), = 0, a case which can always be attained by 
suitably translating p. This is more easily ascertained in terms of creation 
and annihilation operators as in (5.122); indeed, if p is such that 


(A) > = {Tr(p Ai) J= = U = (u, u”) # 0, 


consider p := D'(u)pD(u), with D(a) as in (5.99) (see also Exam- 
ple 5.5.2). Using (5.100) and (5.116), 


FY (z) = Tr (p D(u)e2"'4 D'(u)) =e7Z"U FY (z) 


=e Z" U Z“ U- 4Z*(V°Z) _ e7 }4Z*(V°Z) 
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It is convenient to rearrange F as R= M7, M = M~! = MT, 


Tı 1 000 Tı 
>s || 1/0010 @ 
R=(21=[5 10 o0}la (5.129) 
P2 0 0 0 1 i) 
— m 
M 
As a consequence, the CCR (5.68) now read 
0 1 0 0 
E E @=MbM=|— ° h : , (5.130) 
0 0 -1 0 
and the characteristic function (5.128) becomes 
FO(r) =: G(R) = e7 2$ ($1 VLR) | (5.131) 
0 0 0 1 F 
n 0 0 1 0 1 a s 
where X1 := 0100 and V = rah] More 


1 0 0 0 
explicitly, a same argument as the one that led to (5.113) shows that C 
is a positive real 4 x 4 of the form 


y= (é 4 (5.132) 
osan npa pm hui) 69 
95 B= haD A) Eo 

o= (Tene) none) (5.185) 


By multiplying (5.113) on both sides by M, the necessary and sufficient 
conditions for V to be the correlation matrix of a gaussian state are 
V 522 0. (5.136) 


. Every 2 x 2 real matrix S of determinant 1 such that SJ. ST = Js, 

0 1 
—1 0 
of freedom, can be used to define new one-mode canonical operators, 


A A yi 
T = SP, F= (2): P := & [F Rj] = i(J2)ij. This fact allows for 


where Jo = is the symplectic matrix (2.6) for one degree 
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a greatly simplification of the basic structure of the correlation matrix 
VY [278, 115]. As a first step, consider a positive, real symmetric 2 x 2 
matrix X with a := ,/det(X) and define S := \/X/a; it turns out that 
X = ST € n S. Furthermore, since VX is symmetric and Jo anti- 
symmetric, VX J2 VX is antisymmetric and thus proportional to J2. Its 
determinant is a whence VX Jz VX = a Jz and S Ja ST = Jo °. As a sec- 
ond step, use this result and let S4, g effect the symplectic diagonalization 
of the positive, real matrices A, B in VY, then 


yaa 0\ fal C St 0 
0 SB CT Bi G S) ` 
As a third and last step, notice that the real matrix Č can be written as 
C=U k : ) VT, where c12 are its singular values (see (5.16)) and 
2 


U,V are two orthogonal matrices. If their determinant is 1 then they also 
preserves J2, otherwise set U := Uo3, V := Vaz and 


Yi 0 g Ci 0 a 
0 y2 e 3 0 C2 35 


where now 71,2 need not be both positive. Therefore, any two-mode cor- 
relation matrix can be written as [278, 103] 


a 0 jy 0 
ta 0 0 a 0 Ur o0 
v= (% i) aoe: k op) + 6199 
0 0 8 
Vo 


by means of matrices U4 g such that Uy p J2 UAB = Jo. In conclusion, 
by suitably changing canonical coordinates, one can always reduce V to 


the standard form Vy = (a ea where Ag =: allo, Bo := plo 


and Co := G i" ). Then, for Yo, by imposing the positivity of the 
2 


principal minors of Vo + $2, the condition (5.136) amounts to 


I 
1 +4,>h 4+ 12 + 271s, (5.138) 


where l = a? = Det(Ao), Ip = B? = Det(Bo), Ís =y = Det(Co) and 


5The above argument is the simplest formulation of a more general theorem of 
Williamson on the symplectic diagonalization of positive, real 2f x 2f matrices [115] 
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I, = (aß — 77) (aß — 72) = det(Vo) . 


As the determinants J; are invariant under transformations as those lead- 
ing from V to Yo, that is I; = Det(A), I2 = Det(B), I3 = Det(C) and 
I, = Det(V), inequality (5.138) is necessary and sufficient to ensure that 
a positive, real 4 x 4 matrix V as in (5.132)— (5.135) be the correlation 
matrix of a two-mode Gaussian state. 

3. Consider a two-mode Gaussian state p as in (5.111); a necessary and 
sufficient condition such that R,(q1, q2; p1, p2) > 0 is that the correlation 


matrix V satisfy 
lly 
+ — >00. 
y so 0 
Indeed, using the argument at the end of Example 5.4.3, from the CCR 
relations (5.78) and (5.107) the characteristic function of pr results 


da dy 
pi (27)? E 
x (vac| W(V2(æ,—y)) W (q, p) W(v2(—a, y)) |vac) 

dæ dy 


= J, z Blew) Gur (vacl W(a,p) vac) 
R4 


=i 2 2 da dy i : £ 
— e- }(lal?+lpl®) f. Tr ewe V3(ay+pæ), (5.139) 


Egg) = R(x, y) x 


Because of the argument developed in the first one of the above ex- 
amples, we can assume pr to be a Gaussian function with (7),, = 0; 
thus, by Fourier transform and using (5.131) the result follows from 
IRI? = |Ir||? = llall? + |Ipl|? and 


ILRI? 


—iv2 u- (XıM R) G(R)e Zz 


dR 
R(u) = a ae e 


=| dR oe iv2u-(21MR) .—3R(5E1(V— 314) 21 R) 
2 
R4 7 


where R := (41,P1;92;P2) € Rt, Ma(C) > X, = é a and 
2 
M,(C) > M is the matrix in (5.129). 


Remark 5.5.3. Matrices as those in the second example above form the so- 
called symplectic group for f = 1 degrees of freedom [265]; the symplectic 
group for f > 1 degrees of freedom consists of real matrices S € My(R) of de- 
terminant 1 such that preserve the symplectic matrix J p, that is S Jf ST =] 2 
Setting as before 7’ := SF, the CCR are respected, namely [F], T3] = (Sf )aj- 
Therefore, because of the unitary equivalence of the CCR representations 
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(see Remark 5.4.2), there exists a unitary operator U(S) on the representa- 
tion Hilbert space H = La, (R‘) such that 7’ = U'(S)FU(S). From (5.75), 
the Weyl operators transform as 


Ut(S) W(r) U(S) =e 1S) — ef (57r) (21°) — Ww(STr), (5.140) 
. A C f Z D C ; 
where, if S = G B with A,B,C € M;(R), then S = & 3 while 
ot DT CT 
A oT 
the transposed S* equals cC at): 


Let p € B7 (H) be a density matrix for the f Bosonic degrees of freedom 
and consider the state ps := U(S) pU'(S) obtained by operating the sym- 
plectic transformation of the canonical operators; because of (5.140), their 
characteristic functions (5.116) of p and pg are related by E, (r) = E,(S?r). 
With r = (q,p),u = (x,y) € R*/, (5.139) generalizes to 


Bale) = etr f du R (u) eY) , 


pes (27)? 
O; l; l 
where X1 := 1; O)’ whence the corresponding functions Rps (u) and 
Rp(u) in the P-representation (5.111) are related by 
-r2/4 f du p ivu (51r) — 
g Í. (Qr)f ps (tu) e 
= eT f dU R (S-tu) piv2u (Zr) T 
e Í (27)! ail u)e k ( ' ) 


5.5.1 States in the Algebraic Approach 


As we have seen in Example 5.2.4 and Remark 5.2.3, expectations as in Defi- 
nition 5.5.1 provide semi-norms that equip B(H) with a w* topology which is 
equivalent to the o-weak topology. On the other hand, in Section 5.3.1, states 
have been defined as positive, normalized linear functionals on C* algebras 
which are continuous with respect to the uniform topology of B(H). Since 
this topology is finer and thus has more open subsets than the o-weak one, 
in general expectations need not be also o-weak continuous. The following 
result [64] characterizes the space of density matrices within the more general 
space of states on B(H). 


Proposition 5.5.2. If M C B(H) is a von Neumann algebra with identity, 
all o-weakly continuous expectations on M have the form M 3 X > Tr(p X), 
with p € By (H) a density matriz. 
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Proof: From Example 5.2.4, any o-weak functional F : M > C takes the 
form F(X) = $, (Yn | X [Øn ) where {Yn} and {¢,,} are sequences of vectors 
in H such that 55, |||]? < œ and X, lloll? < œ. 

Set |) = Dalyn), | o)= un | n ) and consider the following represen- 
tation of B(H) on their Hilbert space H, (X)|) = @,,(X|n)), X € B(H). 
Then, F(X) = (| 7(X) |$). If F is positive and M > X > 0, then 


i ads Be vee 
P(X) = 7((b+ 4] m(X) |b + 5) - (8 - 6] a(X) |b - 9) 
Joan et a? 
<a tolr(X) b+). 
By considering the GNS construction based on the vector state |Y+ġ) (once 


normalized), Remark 5.3.2.3 implies the existence of 0 < T’ = (9’)''9’ < 1/4 
in the commutant of 7(M). Since S” maps H into itself, 


F(X) = (@+4|T'x(X) |b +b) = (SG +4) lX) E+) 
=X (xn|X|Xn}, vX eM. 


Set p := D0, | Xn)( Xn |; this operator is positive. If F(1) = 1, then Tr(p) = 1, 
whence p € B (H) and F(X) = Tr(p X) for all X € M. 


Remark 5.5.4. [64] As functionals, density matrices are normal as their o- 
weak continuity is equivalent to the property of normal linear maps outlined 
in Remark 5.2.7. 


Because of the convexity of the space of states (see Remark 5.3.2.5), one 
has that 


Proposition 5.5.1. The space of states S(S) of a quantum system S is con- 
vex and a same density matrix can in general be decomposed into infinitely 
many different convex combinations of other density matrices, unless it is a 
pure state which is thus extremal in S(S). 


Proof: Take any set of 0 < X; € B(H), j € J, such that 7. ; X; = 1; as 
p = 0, in terms of its spectral decomposition p = `} rx| Ye) (Vx |, its unique 
(positive) square-root is given by VP = X} p Vrk| bx )( Gx |. Then, 


Xj 
p= Am, p P, yeToX). (5.142) 
jEJ 
Thus the same density matrix p describes mixtures whose components are 


described by density matrices pj; in turn, these can also be decomposed unless 
they are projectors, as, in this case, p = | ~)(w| implies 
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VeX; V0 = (b| X50) 1d) ¥ 


so that p; = p for all choices of X; > 0. 


Example 5.5.4. [156] Consider two generic decompositions of a same den- 
sity matrix p € S(S) into non-orthogonal one-dimensional projectors, 


P Q 
p= > Velde) t]: P= >, velba) bal VP» 


p=1 q=1 


| wp ){ wp | l2q)( zal 


with pea lwp ){ Wp | = 1 and D 1|¢q)(¢q| = 1. By means of the spectral 


representation p = DA rj|rj}(rj |, setting |v; ) := ./rj| rj), one gets 


The Px N matrix W : CN +> CP with entries W,; := (Yp | r; } and the Qx N 
matrix Z : CN + CS with entries Z,; := (Qq | rj) are such that WIW = 1y 
and ZÎZ = 1y. Then | ve) = Ep- Wpe| wp) and |2q) = Ei- Vpal wp), 
where V = WZt : CS +} CP. Thus, any two decompositions of p into 
projections are related by a P x Q matrix V such that VV' = WW? and 
ViV = ZZi; therefore, if P = Q = rank(p), then W, Z and hence V are 
unitary matrices on the support of p. Also, if P = rank(p), but Q is arbitrary, 
then V is an isometry such that VIV = 1y. 


In Section 5.3.1, states on C* algebras have been used to construct Hilbert 
space representations; in the present setting, a representation on a concrete 
Hilbert space H is a priori given, it is nevertheless instructive to consider pure 
and mixed states of a finite dimensional quantum system S. In such cases, 
the GNS representation amounts to what in quantum information is known 
as mixed state purification. 

Let p € My(C) be a density matrix and consider its spectral representa- 
tion p = ee r;|7r;)( 7; |, some eigenvalues possibly being equal to zero. To 
p one associates the state vector | \/p) € CY @ CN given by 


N 
=} vlr) Slr). (5.143) 
j=l 
Given X € “o let it be represented by ~ = X Q 1y on C” oc’, 


X @ In| vP) -EVEI DSlr)= SA (re|X |r5) [Te ) ®| 75) 


j,k=1 


= Er S (5.144) 
whence (VP|X 8 In |V) = (vP| XVP) = Tr(pX). 
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Pure States 


If p = |¥)(v|, Y € CN, then | yP) := |v) 8 |Y) € CX @CN and 
m(X)|/p) = X|v) 9 |Y), for all X € My(C), whence the GNS Hilbert 
space is (isomorphic to) C. Indeed, by taking the quotient of My(C) with 
respect to the set 


i {xe My(C) : (Y| XtX |) =o} 


the equivalence classes |Y} are identified with operators of the form 
X| y) y|. By varying X, the GNS Hilbert space H, is generated by vec- 
tors |¢)(w| for all ¢ € CY and is thus isomorphic to C^. It follows that the 
GNS representation 7,(M@y(C)) is unitarily equivalent to My(C) and thus 
irreducible in agreement with Remark 5.3.2. 


Faithful Density Matrices 


At the opposite end with respect to pure states, let us consider a den- 
sity matrix p E€ My(C) with eigenvalues r; all different from zero, so that 
Tr(p Xt X) = 0 4> X =0. According to Definition 5.3.5, p is faithful. 

Matrices X € My(C) becomes N?-dimensional vectors whose compo- 
nents are their matrix elements with respect to the ONB consisting of the 
eigenvectors of p, |X) = ae (ri |X |r; )|r:) ® |r; ). Also, by varying 
X € My(C), the linear span of vectors of the form | X,/p) is dense in 
CY CN. Let Y = Dija vislri) @ rj) € CN @CN and X = |rp){rql, 
then (4| X @ In | VP) = YžąyTa Therefore, if 7 is orthogonal to the linear 
span of | X,/p) then, Wj, = 0 for all p,q as p is faithful. 

Because of Remark 5.3.2.1, the triplet (CY @ C%,7,|,/p)) is unitarily 
equivalent to the GNS triplet (Hp, Tp, 2,) corresponding to the expectation 
functional wp : My(C) 3 X = w,(X) := Tr(pX). 

The matrix algebra My (C) is represented by My (C) & 1y on H,; so, its 
commutant is 7,(My(C))’ = ln ®My(C), t(My(C)) has trivial center and 
is thus a factor. The action of the commutant is given by 


nf BR Ys (5.145) 


N 
In @X| Vp) =D) Vril ry) 8 (Xlr;)) = 


where XT denotes the transposition of X with respect to the eigenbasis of p. 

We can now look at the decomposers in (5.142) from the point of view 
of Remark 5.3.2.3. Given a convex decomposition p = } jej Aj;0j, every Oj 
corresponds to a unique 0 < X; in the commutant 7(My(C))’, thence to a 
unique 0 < X; € My(C), such that X; = 1n @ X} and 


Ayoi(X) = VB] e(X)X5 VP) = (VB |X @ X} | vB) 
(Jal Xp Xj) = (Va XVP X), (5.146) 
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whence A; = Tr(pX;) and oj = (VP X; ./p)/A; 


Example 5.5.5. Consider a two-level system equipped with the density ma- 


. 1/1-s 0 ‘ 
= — P < < H 
trix p 5 ( 0 RA. 0 < s < 1; as GNS vector we can take its 


purification (5.143) 


Lts 


s 
| Ve) = |0)@|0) + |1)@|1), 
where |0), |1) are the a of p. The corresponding GNS representa- 
tion is 7(M2(C)) = M2(C) 8 lz with GNS Hilbert space C4, 
= s s 
(vel X 81|vp) = (0| X |0) + (1|X|1) = Tr(eX) , 
for all X € M2(C). The commutant is 7,(M2(C))’ = 12 & M2(C) so that 7, 
is reducible and a factor since 7,(M@2(C))” = m,(Mo2(C)) whence its center 
(see Definition 5.3.4) is trivial, Zp = m,(Me2(C))” A m,(Ma(C)) = {Al}. 


Modular Theory 


The GNS state of any faithful density matrix is separating for 7,(M@dy(C)), 
namely 7,(X)|,/p) = 0 = > X = 0, and thus cyclic for the commutant 
Tp(Mn(C))’ (see Lemma 5.3.1). 

We shall now give the fundamentals of the so-called modular theory that 
looks particularly simple for finite-level systems. 

Let p be a faithful states and identify its GNS triplet (Hp, Tp, Rp) with 
(CX @ CX, My(C) @ In,|/p)). The so-called modular conjugation is the 
antilinear map J, : CN @ CN +> CN @CN such that 


E Slr) = Jy) = Saia Y@ |r). (5.147) 


i,j=1 i,j=1 


It satisfies J? = 1 and J,| VP) =| VP); furthermore, 


N 
Jol XVP) = J (X @ Uw) Jol VP) = X, vrir |X lri)" |e) lri) 


ik=1 
= Ay 8 X*| a) =| Vax"), (5.148) 
where X* is the conjugate of X with respect to the ONB {|rj;)}j1, that 


is (r| X*|r;}) = ((re |X |rz))*. Given X,Y,Z € My(C), one explicitly 
computes 
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[ixx @1y)J,, Y @ Iy||Z/p) = 


= J (X 8 In) In| YZ/p) — (Y @ In) Jp(X @ 1n)|/pZ") 
= J| XV/p(YZ)') — (Y @ Iw)| Z/pX") 
=|YZ/px't) —|YZ,/px') =0. 


Thus, J (X @ 1y)J, belongs to the commutant ly ® My(C) = m,(Mn(C))’ 
of My(C) ® In = m,(My(C)); since the GNS vector |,/p) is cyclic for 
Tp)(My(C)) it is separating for 7,(My(C))’. Therefore, from (5.148), 


Jp X @ ln Jp = My @ X*, (5.149) 


whence J, antilinearly embeds 7,(My(C)) into its commutant 7,(My(C))’. 
Actually, the embedding is an anti-isomorphism, 


Ipt(My(C))Jp = 1,(My(C)) . (5.150) 


Indeed, using (5.145), for any S € m,(Mn(C))/ and Z € My(C), it holds 


S'| VB) = Ys VP) = | VA(VBYS =") = Jp (VE vP) © 1x Jol VP) 


1 
vP vP 
where the first inequality is due to the fact that $”|,/p) is a vector in Hp 
which can be obtained by acting on the GNS vector with some 7,(Ys-). Then, 


S= Jp (p YE VP) 8 1x dp 


A related notion is that of modular operator 
Ap := pag” (5.151) 
which, according to (5.148), is such that, 
1 


“al XvB) = Jol VOX) =| XVP). (5152) 


J,A? XVP) = JVP 


5.5.2 Density Matrices and von Neumann Entropy 


From the point of view of the GNS construction, pure states can be dis- 
tinguished from mixed states because their GNS representations are non- 
irreducible factors for the latter, while they are irreducible factors for the 
former. There are however handier ways to sort these states out; perhaps the 
easiest is to consider p°: if more than one eigenvalue of p is non zero, then 
p is mixed since then p? Æ p. Indeed, the spectrum of a mixed state p has a 
richer structure than that of any one-dimensional projection. 
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The possibility of comparing, in some cases, the eigenvalues of two density 
matrices p12 € Bf (H) comes form the so-called minmaz principle of which 
we give a short sketch (for a general formulation and proof see [252]). We 
shall consider the eigenvalues listed in decreasing order; namely, in the spec- 
tral decompositions B} (H) 3 p = X; ri|ri)(ri|, with eigenvalues repeated 
according to their multiplicities, we shall take r; > rj+1, for all i. Because of 
the ordering, it turns out that 


ry =sup{ (wl plb) : [lll =1, 1%) L {lri} r2) 1ri1)}} (6-153) 


Indeed, for a 7 specified as above, (|p|) = ops; rel (Y| re) |? < ri and 
ri is achieved by choosing |) = |r; ). The minmaz principle asserts that 


r= inf sup{(blolw) : [lel =1, 1d) L {I r),]42),--- 51-1) Fb , 
{i} 
(5.154) 


where {¢ġ; eae is any set of vectors in H. 

In order to prove this relation, we denote by U,({@, =) the argument 
of the inf and show that U,({¢; D > ri, whence the result follows for r; is 
achieved by choosing | ¢;) = |r; }, j = 1,2,...,i— 1. Now, there surely exists 
a normalized vector |W) = Yai cel Tr) L {ġj = indeed, if P projects 
onto the linear span of {| r; )}5—1, the vectors {P| ¢e) iZi Span at most an 
(i — 1)-dimensional subspace. But then, 

i i 
Utk) > (2 lolt) =X rr ler? = ri dD le? = ri - 
k=1 k=1 
From the minmax principle, it follows that pı > po => e;(p1) > e:lp2), 
where e;(p) is the i-th one in the ordered list of eigenvalues of p. Further, for 
generic p12 € BI (H), the minmax principle provides an upper bound to the 
differences |e;(~1) — e;(p2)| in terms of the trace-norm (5.21). 


Example 5.5.6. Given p12 € Bİ (H), decompose pı — p2 = R} — R_, where 
R4 are positive orthogonal operators, so that 


e1 — pall, = Tr(Ry + R-) = Tr(2R — pı — p2) , 


where R := pi + R- = pa + Ry > pip- Let ri, respectively ri? be the 
eigenvalues of R, respectively p1,2 listed in decreasing order; then, by the 


minmax principle, r; > r}? for all i. Thus, 2r; — r} — r? > |r} — r?| => 
Llr — ril < lle — ella 


The spectrum of a density matrix is a classical probability distribution: 
this hints at the possibility of quantifying its information content of by means 
of the Shannon entropy of such a distribution. This leads to the notion of 
von Neumann entropy of a state p € B] (H) [226]. 
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Definition 5.5.2 (von Neumann Entropy). 

Given p E€ S(S) with spectral decomposition (degenerate eigenvalues are 
repeated according with their multiplicity and with chosen orthogonal one- 
dimensional eigenprojectors) p = X`; 73 |T;)(Tj|, the von Neumann entropy 
of p is the Shannon entropy of the probability distribution corresponding to 
its spectrum: S(p) = —Tr(plogp) = — 2; rjlogrj. 


The following are some of the properties of the von Neumann entropy: 
they show that it plays a role similar to that of the Shannon entropy in a 
classical context. Other properties more related to composite systems will be 
discussed in Section 5.5.3. 


Proposition 5.5.3. Let p € B} (H) be a density matriz. If H = C%, then 
the von Neumann entropy is bounded by the entropy of the state In /N: 


0< S(p) <logN. (5.155) 


The von Neumann entropy is concave; that is, given weights \; > 0, i €I, 
Dicer Xi = 1 and density matrices pi € BY (H), 


5 AS (pi) < S (= wn] < S > AS (pi) F Xn) ; (5.156) 


ie. ie] wel wel 


where is the concave function (2.84). IfH = C%, the von Neumann entropy 
is continuous on + (A) with respect to the trace-norm; namely, if pi, € 
By (H) are such that ||p1 — pall: < 1/e; then they satisfy the so-called Fannes 
inequality 


|S(p1) — S(p2)| < |le1 — pllilog N + n((le1 — palla) - (5.157) 


Proof: Since S(p) —logN = — > ri(logr; — log1/N), boundedness 
follows from (2.85). The lower bound in (5.156) comes from the concavity 
of n(x); indeed, let rj and |r; ), respectively rj and |r) be eigenvalues and 


eigenvectors of p := jez Aipi, respectively p;. Then, 


rj = Dd 15 | pi Ir5) =D Del Krk lea)? 
tel tel 
with Jez p Ai |( ri |r; )|? = 1. Therefore, 
p) = Sonlrs) A Aril nh) =X AA nr) = YAS (o). 
j iEI,k wel k on) 


On the other hand, from Example (5.2.3).9, 
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Api < p => Aipilog(Aipi) = Aipi log pi — Na) < Aipilog p . 


Fannes inequality follows from the fact that |n(u) — 7(v)| < n(\u — v|) if 
|u — v| < 1/e and from Example 5.5.6 which implies that 


N 


sk = [rh = rèl < >) sk =: S < lloi- pelli < 1/e , 
k=1 


1,2 n 
where r,“ are the ordered eigenvalues of p1,2. Then, 


N N N 
IS (a) -S (e)l < XO nerh -nra Anla) = S Sa + n(5) 
k=1 k=1 


k=1 
< ||. — pallalog N + nllo — paella) ; 


for n(x) increases when x € [0,1/e]. We complete the proof by showing that 
Inu) — n(v)| < n(\u — v|) indeed holds when |u — v| < 1/e (see [222]). The 
function f(x) := n(x + (u — v)) — n(x) decreases for u — v > 0, thus f(0) = 
nlu — v) > f(v) = nlu) — n(v). If u — v < 1/e, nlu — v) > u — v, while the 
increasing function g(t) := t+ n(t) gives glu) = u+n(u) > v+n(v) and thus 
nlu) — n(v) > v — u which implies ņn(u — v) > n(v) — n(u). 


Remarks 5.5.5. 1. The second inequality in (5.156) becomes an equality 
if and only if the ranges of the matrices p; are orthogonal to each other; 
indeed, in such a case the eigenvectors of different p;’s are orthogonal so 
that their spectral decompositions give the spectral decomposition of p, 


p= >> dril rt) (rk | = 8 (p) = Do Arf log(iirF) . 
ik ik 


2. In the case of an infinite dimensional Hilbert space, the von Neumann 
entropy is only lower semicontinuous: if a sequence of density matrices 
On tends to a density matrix ø in trace norm, then S (ca) < lim, S (on), 


in general. As an example [300], take on := (1 — —)p + — pn, where 
n 


S(p) =0 and S (pn) increases like n. Then, ||on — p||i < 2/n — 0 when 
n — +00; also, by (5.156), 


1 1 1 
8 (on) > (L= =) $(p) + ŽS (Pn) = —S (pn) >c > 0=8 (p) . 
The least mixed states, the pure states, are 1-dimensional projectors for 
which rı = 1 while all other states have rı < 1. This suggests the following 


Definition 5.5.1. /300] A density matriz pı € S(S) is said to be more mixed 
than another density matrix pa € S(S), pı = p2, if their decreasingly ordered 
eigenvalues ei(p1,2), j =1,2,...,d, satisfy 
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The relation = is a total ordering among density matrices of two level 
systems; this is because e€1 (1,2) +€2(p1,2) = 1 for any pair of density matrices 
1,2 E Mo(C). Therefore, pı = po <=> e1(p1) < e1(p2). 

Unfortunately, for higher dimensional systems > is only a partial ordering; 
for instance, consider the following density matrices p1,2 E€ M3(C), 


1/2 0 0 2/3 0 0 
p=| 0 12 0|, =| 0 16 0 
0 0 0 0 0 1/6 


Then, e1(p1) = 1/2 < e1(p2) = 2/3, but e1(p1) + e2(p1) > e1 (62) + e2(p2). 
The following proposition provides a helpful tool that allows, in some 
cases, to establish whether two density matrices are in the > order. 


Proposition 5.5.4 (Ky Fan Inequality). Given p € B} (H), it holds that 


i 
Y e;(o) = max{ Tr(Pp) : P?=P= P? , dim(PH) = it . (5.158) 


j=1 


Proof: Let {|¢;) Ea be an orthonormal set in H, K the subspace they 


generate and P := Eii | dj )( Q; | the corresponding orthogonal projector. If 
k k 
{| Ji) H1 is any other ONB in K, then $°(4;|p1¢;) = X (4 |p lyy). Let 
j=l j=l 
|r; ) be the eigenprojectors of p corresponding to the eigenvalues e;(p) =: r; 
listed in decreasing order and consider the subspace spanned by {|7r;)}*7. A 
same argument as in the proof of the minmax principle (5.154), ensures the 
existence of some |Y} € K orthogonal to it; analogously, there must exist 


|ve-1) EK L {|r} Hoy Ul de} 


and so on. Thus, one collects {| Y; )}4_, € K such that 
li) L {lri} Irj) l Vt) 


and (5.154) yields Tr(P p) = Di1 (Vi lo lti) < Ei ea). 
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Examples 5.5.7. 


1. Given any p € S(S) and unitary matrices U; € My (C), j € J, together 


with weights 0 < Aj < 1, DyeyAy = 1, set B= jez àj Uj pU}. If 

p= S ri|ri)( ril, the unitarily rotated matrices U; pU} have the 
N 

same spectrum for U; pU} = lee and |7;) := U|r;) form an 
i=l 

ONB . Further, their convex combination p = p; indeed, let P be the 

projector achieving the maximum in (5.158), aa ei(p) = Tr(Pp); then 


k k k 
X e) = x Aj Tr(U} PU, pf) < DD Aj) 3 eilp J=) el) 


i=1 jEJ jEJ 


for the projectors Uj PU; need not achieve the maximum in (5.158). 


. Consider the convex set Bý (H) of all density matrices of a system S 
described by a Hilbert space H, not necessarily finite dimensional, and 
let Sora(S) be totally ordered: the most mixed p € So;q(S) have the 
largest entropy [300, 314]. Indeed, pı = p2 => S(p1) > S(p2). In order 
to show this, set a; := e;(p1); then, for all N > 1 and ay > 0, 


N-i k N 


OL ai) (log ax, — log ak+1) + logan = So or log ay, . 
k=1 i=1 k=1 


Set 6i := e;(p2); since pı = po => = 1 ai S Sul , ĝi for all k > 1, 
using (2.85), one finds 


N N-1 k 


Xo -or logak > — 5 © 6) (log ag — log ak+1) + logan 
k=1 k=1 ‘i=l 
= -Y ict >- 5 Be log 6, + x (Bk — ax) , 


k=1 k=1 


for all N > 1, whence S (p1) > S (p2), for >>, bk = X pak = 1. 


5.5.3 Composite Systems 


In quantum information, physical systems § consisting of several subsystems, 
S = Si + S2 +- Sn, are called multi-partite. 

If each of the constituent subsystems is described by a Hilbert space H;, 
the Hilbert space of S is H™ = @/_, H; and its observables are Hermitian 


elements of the C* algebra B(H‘)) = 


Qiz 


B(H;). 
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Given a multi-partite state p E€ S(S), marginal states Piisi, for all pos- 
sible choices of subsystems S;, + Si, +...+.5;, are obtained by partial tracing 
over the Hilbert spaces He whose indices are different from the selected ones 
71, i2,..., ik, namely 


Piyio--in = Try Tess Fs ‘Trin (P) ; Je F 24,122, .. stR 5 l= 1,2: TTTS i k ; 


where Tr;(p) = 2 RO | p we D denotes the trace computed with respect 


to any ONB 4| yË D) € Hj and yields a density matrix acting on the Hilbert 
HI je = Oli iF 


In particular, ‘ne states of bipartite systems S = Sı + S2, are described 
by density matrices p12 € B (HC) with marginal states 


pı = Trapi2 := > (ve? | P12 YP) , p2 = Tripi2 := Coe | p12 [we ) 


j i 


Proposition 5.5.5. The marginal states of any pure state p12 € S(S1 + S2) 
have the same eigenvalues with the same multiplicity, apart from the zero 
eigenvalue, and thus the same von Neumann entropy. 


Proof: Let | W12) € H be the vector onto which the pure state p12 projects 
and r®, | p ), the non-zero eigenvalues (repeated according to their multi- 
plicities) and eigenvectors of the marginal density matrix pı = Trap. Using 
the corresponding ONB {| r® )} in Hy and any other ONB {| pP )} in Ho, 
one can expand 


IYi2) = D> Culr{?) 14) = Tir) @ 14), 
j,k F 


where 19? ):= 0, Coe oP ) need not be either orthogonal or normalized. 
Then, 


m= ole Me l= ler lee dire Meg | 


j j,k 


whence (of 2 g@ y= Sjer? ), Setting j yim | 9?) )/ D yields 


| Y12) =e fr? Iri ae Dy, (5.159) 


It thus follows that p2 = >, e] p” x r?) |, whence S (p1) = S (p2). 
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Remark 5.5.6. The expression (5.159) yields the so-called Schmidt decom- 
position of bipartite pure states | V12) € Hı ® Ho into a linear combination 
with positive coefficients of tensor products of equally indexed states from two 
ONBs of the two subsystems. The degeneracy of the 0 eigenvalue accounts 
for the possibly different dimensions of the Hilbert spaces H 2. 


Example 5.5.8. Let a bipartite system S = Sı + S2 consist of a 2-level 
system Sı and an N-levl system S2. Let {X;}?_, E€ M2(C) be a set of matrices 
such that paar x) Xi = lle, consider a fixed vector y € C? and the vector 
state C? 9 CN > |W) := iL, Xi] bh) 8 |i), where {|2)}%, is an ONB for 
S2. The normalized vector | ¥ } yields marginal states 


N 
M,(C) 3 pı = Tral|¥ (Ul) = X Xv IX 
N 


Mn(C) 3 p = T(P P) = So IX} Xil) 14) (4 - 
ij=l 
Let ra, |a), a = 1,2, be the eigenvalues, respectively eigenvectors of p1; by 
expanding X;| Y) = 37?_, cia|a), it turns out that |W) = 57?_, |a)@|¢a), 
where | Qa) := ak Cia |i). The vectors | ġa ) := | ba )/||¢al| are orthonormal 
and the 0 eigenvalue of p2 = r1| 1 )( $1 | +72] 92) (¢2 | is (N — 2)-degenerate. 


We end this section with a list of properties of the von Neumann entropy 


which pertain to composite systems. 


Proposition 5.5.6. Let S = Sı+ Sə be a composite system with Hilbert space 
H = Hy ® H2, Hı 2 of dimension dı 2. The von Neumann entropy is additive 
on product states By (H) 3 pi2 = p1 8 P2, 


S (p12) = S (p1) + S (p2) - (5.160) 


Given pı2 € B} (H), let BS (Hy) 3 p1 := Tr2p12 and BY (Ho) 3 p2 := Tripi2 
be the marginal states. Then 


|S(p1) — S(p2)| < S(p12) < S(p1) + S(p2) ; (5.161) 


The second inequality expresses that von Neumann entropy is subaddi- 
tive; more in general, the von Neumann entropy is strongly subadditive. 
Namely, let S = Sı + S2 + S3 be a tripartite system described by a state 
p123 € B (H) with H = Hı @ Hy @ Hz. For any cyclic permutation (i, j, k) 
of (1,2,3) let By (H; ® H;) > pi; := Trkp123, By (Hi) > pi := Trjkpi23 be 
marginal states. Then [192, 314] 


S(p123) + S(pj) < S(eiz) + S(pjx) - (5.162) 
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Furthermore, the differences between the von Neumann entropies of pig and 
of the marginal states p12, S(pi2) — S (p1,2), are concave: 


Fare | — 3 | So an] = a(s (2) - S (0/3) , (5-163) 
j j j 
where àj > 0 and >), A; =1. 


Proof: Additivity comes from the fact that the spectrum of p12 = p1 ® p2 
consists of the products of the eigenvalues of pı and po. 

Assume strong subadditivity holds and let pag = >>, r 
a density matrix on H4 ® Hpg and 


AP ef? (nA? | be 


E Lyri” Ir AB) @ |rA®) € (Ha 2 Hp) & (Ha @ Ha) 


the corresponding GNS state. Set Hı := H4, Hə := Hpg, in the first factor, 
H; := H4 ® Hpg in the second one and 


nepie TABAB riB Nr F Blr i | 


Then, p3 = Trı12p123 = pap = Tr3p123 = p12, therefore, pı = Tr23p123 = PA, 
p2 = Trı3p123 = pB. Also, because of purity, S(p123) = 0, thus (5.162) and 
Proposition 5.5.5 yield 


S(paB) = S(p3) < S(p13) + S(p23) = S(p2) + S(p1) = S(pa) + S(pB) 


which implies subadditivity. The lower bound in (5.161) follows instead from 


S(pa) = S(p1) < S(p13) + S(p12) = S (p2) + S(p3) = S(pB) + S(paB) 
S(pB) = S(p2) < S(p12) + S(p23) = S(p3) + $(p1) = S(paB) + S(pa) - 


In order to prove strong subadditivity, we introduce the quantum relative 
entropy of two density matrices p and ø (see Definition 6.3.1) 


S 
S 


S(p,o):= Tr(p log p — p loga) 
which is well defined when o| y) = 0 => p|w) = 0. 


We shall prove in Section 6.3 that S (p, 0) does not increase under maps 
like the partial traces, thence 


1 1 1 
S (ow, n @ p) =5 (Tespis, ie @ px) <S (ve. a @ px) : 


On the other hand, 


222 5 Quantum Mechanics of Finite Degrees of Freedom 
lı 
S| p12, 7 8 P2) =—S (p12) + S (p2) +logdı and 
1 


1 
S (+s ; a & pas) = —S (p123) + S (p23) + log dı , 


whence (s (p123) — S (p2s)) -= (s (P12) — $ (p2)) <0. 


The concavity (5.163) follows from another property of the relative en- 
tropy which shall be discussed in Section 6.3, namely its joint convexity: 


S |2 Aal od DY <E as (0 , 0) 
j 


y jo l 
where A; > 0 and >), à; = 1. Let pO i= pW and o) := = pP ® i with 
2 


pP: D) a = Trop) then, 


S| Do Melb Ae @ = 
-5 Delt os Dt + log də 


eX as (oho! z)= aie (of?) — 5 (o®)) + log da . 


J 


5.5.4 Entangled States 


One of the most puzzling and fascinating aspects of quantum mechanics is its 
non-locality embodied by the concept of quantum entanglement [152]. This 
is a property of certain quantum states of composite systems, called entan- 
gled, which are such that their constituting subsystems cannot be attributed 
properties of their own, not even with a certain probability. As a paradigm 
of such states, consider the following vector state of two spin 1/2 particles 
(the simplest instance of the vector states (5.18) in Example 5.2.3.9), 


|00) + | 11) 

V2 3 
where |0) and |1) are eigenstates of the Pauli matrix o3. By looking at the 
corresponding projector, 


| Yoo) := (5.164) 
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1 
|Yoo) (Woo | = 5 (0)(0| @10)(0] +11)(1] @11)(11) 


+ F(10X11810)(11+11X0]811)(0]) , 


one sees that, while the first line is the density matrix of an equally distributed 
mixture of both spins pointing up and down along the z direction, the in- 
terference term in the second line forbids to attribute these two properties 
with probability 1/2 to the component spins. By rotating the orthonormal 
projectors (103) /2 into any two other orthonormal pairs (1+n,-o)/2 and 
(L n2-o)/2, the same obstruction occurs along any two directions n1 2. 


Example 5.5.9 (Bell States). [224] The symmetric vector (5.164) is the 
first one in the so-called Bell basis of C? @ C? of which the others read 

r |01) +] 10) á |00) —|11) r |00) —|11) 

Wo. ) = ———— , |Zo) = ——— , [k ) = ——— _... 

| Yo. ) Ji | “10 ) fa | %11) T 
In quantum information 2-level systems are called qubits, unitary actions on 
them quantum gates and nets of unitary gates quantum circuits. The Bell 
states can be created out of a separable pure state of two qubits by means 


of local and non-local operations, according to the quantum circuit in Figure 
5.4. 


Un 


| Vay) 


Ucnor 
Fig. 5.4. Bell States 


The input vectors |x) and |y}, x,y = 0,1, are members of the so-called 
computational basis; |x), called control qubit, is subjected to a Hadamard 
unitary rotation (see (5.58)), Uz, and then, together with | y), called target 
qubit, to a so-called Control-Not unitary gate, Ucnor. The first transforma- 
tion affects one of the two qubits only via the matrix M4(C) 3 Up @ lz and 
is thus local; the second one involves both qubits in a non-local way. Indeed, 
the unitary matrix Ucnor € M4(C) implements the classical CNOT gate, 
CNOT(z,y) = (x, y®2), that acting on pairs (x,y) of bits leaves the control 
bit unchanged and adds it to the target bit: 
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CNOT(00) = (00) CNOT(01) = (01) 


CNOT(10) = (11) CNOT(11) = (10) 


If we substitute pairs of bits with tensor products of computational basis 
vectors, the Pauli matrix gı flips the |x} so that the same relations are 
implemented on C? @ C? by 


onor =|0M0]@ 1+ |1)(1]e01= (4 d i (5.165) 
|x) |x) 


Ucnor 
Fig. 5.5. CNOT Gate 
Let |W) := UcnorH & 1| zy); by a Hadamard rotation (5.58), one gets 


s ye Lyne yes) ON t(criLyel) 
ov) = Fe DL 1) | „ya y= a ; 


and, by varying x,y € {0,1}, one obtains the Bell basis. 


Entanglement is a purely quantum phenomenon, with no classical coun- 
terpart; it has from the start attracted a lot of scientific and, unfortunately, 
also pseudo-scientific interest; one of the great merits of quantum informa- 
tion is to have promoted entanglement to the status of a physical resource 
for performing informational and computational tasks otherwise impossible 
in a purely commutative context. 

In the following, we shall mainly focus upon bipartite discrete quantum 
systems consisting of two parties described by means of finite dimensional 
Hilbert spaces C” and C®, respectively. Within the state-space S(S1 + S2), 
one distinguishes separable from entangled states. 


Definition 5.5.3 (Separable and Entangled States). A density matrix 
p € S(S; + S2) is separable if and only if it can be approximated in trace 
norm by a linear convex combination of tensor products of density matrices: 
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p= YS. Aves Or: MGSO, YY Bag iu (5.166) 


(i1,t2)ETi X I2 (i1,i2)€I1 X I2 


Those p € S(Sı + S2) which cannot be written in a factorized form as 
in (5.166) are called entangled or non-separable states. 


Remark 5.5.7. Pure separable states are of the form |Y) = | Y1) 8| Y2) for 
some Yı 2 € C%2, otherwise they are entangled. The set Ssep(S) of separable 
states of the bipartite system S is the closure of the convex hull of its separable 
pure states (see Remark 5.3.2.5). 


In order to judge whether a pure bipartite state is entangled or separable 
is sufficient to look at its marginal density matrices. 


Proposition 5.5.7. A vector state |W.) E C% @C® of a bipartite system 
Sı + S2 is separable if and only if its the marginal states pı,2 are pure. 


Proof: If |%2) is separable, its projector is the tensor product of two 
projectors and partial tracing yields one of them. Vice versa, if the marginal 
density matrices are not projectors, then the Hilbert-Schmidt decomposi- 
tion (5.159) contains more than one pair and | %»2) is entangled. 


The structure of separable states is apparent from (5.166): they can be 
obtained by mixing with weights \;,;, otherwise independent states of Sı 
and S2, the only possible correlations between them being those relative to 
the probability distribution {A;,;,} associated with the weights and thus of 
purely classical nature. Instead, pure entangled states carry correlations that 
are purely quantum mechanical. 


Examples 5.5.10. 


1. Consider a bipartite system consisting of two d-level systems in the 
state (5.18) which generalizes the two qubit symmetric state (5.164). 
From partial tracing the projector P4 = |W¢)(W4|, one gets 


d 
nae 1 
Hiesz 


namely the totally mixed state for both parties. Thus, the von Neu- 
mann entropy of the bipartite state Pe is smaller than that of either 


Pi = p2 = 


ale 


its components, 0 = S$ (F) < S(p1,2) = logd, which is maximal, in- 
stead. In other terms, the information content of the entangled pure 
state Pe is smaller than that of its constituent parties. This holds for 
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all entangled pure states. In order to see this, one uses the Schmidt- 
decomposition (5.159) and Proposition 5.5.5: if p12 = | V12 )( Yio |, then 
S(piz) = 0, and S(p1) = S(p2) = ee 17; logy ri > 0 unless p1,2 are 
pure states and thus p12 separable. 


. The previous observation makes the statistical properties of pure entan- 


gled states incompatible with classical ones; indeed, by (2.88) one knows 
that the Shannon entropy of a bipartite classical system (described by 
two random variables) cannot be less than that of any of its marginal dis- 
tributions. This classical behavior is characteristic of all separable states; 
namely, the von Neumann entropy of all separable bipartite states cannot 
be smaller than the von Neumann entropy of their marginal states. This 
fact follows from (5.163); in fact, consider a separable state 


12 = >> Agel? © pf € BY (Hi ® Ha) 


a 


so that py = >, AP pf, AY == DO, Aig; then, by applying (5.163), (5.160) 
and the positivity of the von Neumann entropy (see (5.155)), one gets 


S (p12) — S (pr) > rs (s (eP) -5 (®)) 
z Sas (P) >0 
ij 


. The so-called GHZ states are entangled pure states of tripartite systems 


(000) +] 111) 


consisting of 3 qubits : |4) := in the computational 


basis. Though entangled as tripartite states, all their two qubit marginal 
states are separable, for instance 


1 


1 
1 sees 1 
pis = 5 Tre D> eD |’) Gad) = 5 AHL @ LEAL. 


i, j=0 i=0 


From |84 ) one obtains an ONB in H®) = (C?)®3 by acting locally with 
the Pauli matrices, 


|Yarc) = 02 Q o? @aS| G1), a,b,c =0,1 
1 
1 E . at 
(Baes (Base) = 3 Di (iloi |i) (ilog li) (ilosta) 
— 


i,j=0 ea 


1 
1 r rom a 
= 5 (1 (al oft jt) Ca] of? ji) = Saad beds - 
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5.6 Dynamics and State-Transformations 


The standard time-evolution of quantum systems is typically described by 
a strongly continuous one-parameter family of unitary operators {U+ }reR on 
a Hilbert space H, fulfilling the group composition law UUs = Ut+s, for all 
t,s € R. By Stone’s theorem [300], the group is generated by a self-adjoint 
operator H on H, the Hamiltonian, such that, for all y € H, (ñ = 1) 


Ald) = iH p), (e) =U), Uae. (5.167) 


This type of time-evolution equation is proper to the so-called closed quan- 
tum systems. As any other physical system, also closed systems S are in 
contact with the environment E which contains them; however their mutual 
interactions are negligible and the dynamics of S is independent of E and 
is reversible. When the interactions between S and E cannot be neglected, 
it may nevertheless be possible to derive a closed dynamics for the system 
S alone which nevertheless accounts for the presence of the environment. In 
such cases, S is known as an open quantum system and its so-called reduced 
dynamics is irreversible and incorporates noisy and dissipative effects due to 
the presence of E. 

The Schrödinger time-evolution easily extends from vector states to mix- 
tures. Since pure states | w)( | evolve into pure states U;| w)(w |U}, exten- 
sion to convex combinations of pure states yields the Liouville equation 


3p = —i |H, pı] , (5.168) 


with formal solution p; = U; p Už for any initial state p € B} (H). By duality 
(compare (2.9) and (2.7)), if X € B(H) then Tr(p X) = Tr(p X:+) and one 
gets the Heisenberg time-evolution equation for the operators 


3X: =i |H, x,| (5.169) 


This gives rise to a one parameter family {U;}:eR of automorphisms of B(H), 


XU, [X]:= X; =U) XU, , (5.170) 


that preserve hermiticity and products of operators, 


UX] = uX] p, U[XY]=uU,[xX]U,[Y] V X,Y ¢ BH). 


As automorphisms, these linear maps are positive, and also completely 
positive as their action is of the Kraus-type discussed in Proposition 5.2.1. 
By duality the action of U is transferred to the action of the dual Uf on 
BY (H): U;*[p] = UrpU!; Uz preserves the trace, Tr(%;[p]) = 1, and sends 
projectors into projectors, 
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P? = P = Pt = (UF [P])? = UF [P] = u [PI . 


A state p is an equilibrium state if and only if it commutes with the Hamil- 
tonian that generates the dynamics, Uf (p) = p & [H, p] = 0. However, 
if a state p changes in the course of time under a time-evolution imple- 
mented by unitary operators, its spectrum does not. As a consequence, as 
much as the Gibbs entropy of classical probability distributions evolving 
under a Hamiltonian flux on phase-space is a constant of the motion, the 
von Neumann entropy is always preserved by the Schrödinger-Liouville time- 
evolution, $(U;*[p]) = S(p). 


Examples 5.6.1. 


1. We have seen that density matrices p of 2-level systems are identified 
by their Bloch vectors p € RÌ. By denoting them as kets | p), the linear 
action of the commutator on the right hand side of (5.168) corresponds to 
a 3x3 matrix acting on | p}, whence the Liouville equation can be recast 
in the form 0;|p) = —2H |p). Since [1, p] = 0, it is no restriction to 
take the Hamiltonian of the form H = w-o with w = (w1, w2,w3) € R3, 
o = (01, 02,03). Then, the algebraic relations (5.56) yield 


dp i 3 0 w3 —W2 
a 
IE = 2 2 EijkWjPk => H= —w3 0 Wy 
j,k=1 0 W2 —Wwı 
Thus, Bloch vectors rotate with angular velocity w = ||w|| around the 


direction of w; their lengths are then constant, pure states remain pure 
and the surface of the Bloch sphere is mapped into itself. 
Suppose w = (0,0,w), then, by series expansion, 


U; = #03 = coswt + io3sinut ; 
thus, o3(t) := Ufos U; = 03, while 
o4 (t) := UÏ o4 U = 04 (cos 2wt + i sin 2wt) = o4 e . 
The same result more directly follows from the fact that 
oka, =o}! o} 03 + 204 = c4 (03 +2)* 


and that this relation extends to functions f(ø3) that can be expanded 
as power series, namely f(o3)o4 = 04f(o3 + 2). 
2. Consider an array of N spins 1/2 equipped with the Hamiltonian [300] 


N N N-1 N 


Hy =~ Bpo +X Y eli) oi ot = NO (H; + Hi"), 


j=1 j=1 i=1 j=1 
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where, as in Example 5.4.1, gf denotes the Pauli matrix o3 at site j 
and products of Pauli matrices at different sites denote tensor products 
of commuting operators. The single sum corresponds to the spins being 
coupled to a vertical constant magnetic field B, while the double sum 
describes spin-spin interactions whose range is regulated by the coupling 
constants e(i) which only depend on the distance between spins. 

Let the array be provided with periodic boundary conditions oy = 
(o# = 03,4); then, the j-th spin interacts with the same strength with 
those symmetrically placed on its left and right hand side. Suppose N 
odd, it follows that Hi” can be recast as 


+N 


(N-1)/2 
Hy = 5 eli) (oio; + gigi) . 
i=1 
Using the previous example and the fact that o}, commutes with all spin 
operators from sites different from k, one obtains 
oÍ (t) := eN gi etn = gi (5.171) 


itHn „Í a—itHN _ eit (Hi + Hj”) A e it(Hi+H;”") 


a(t) = oe 


= of eH Bmt Sr P (lo +07") 


| (N-1)/2 
= gf er II ( (cos 2te(i) +o sin 2e(i)) x 


i=1 


x (cos 2teli) + of +* sin 2<(i))) (5.172) 


while ø? (t) is obtained by taking the adjoint of oi (t). Let the spin system 
be endowed with a state which is the tensor product of equal pure states 
as in (5.104) each of them for each one of the sites 


N : 
1 1l+s V1 — set? 
P=, p= ( 


2\ VI — s2e7*? l—s 


This state is not invariant under the time-automorphism in (5.171) 
and (5.172): indeed, p®% (o4(t)) = s for all j, but 


n=l 


(N—1)/2 
PN loH) = 2m ploi) TI ( (cos 2te(i) + p(o’) sin 2e(i)) x 
x (cos 2te(i) + p(oft*) sin 2e(i))) 
2itBu; V l= 


8? 5 
=e 2 f(s.) ? 
(N-1)/2 


f(s, t) = JI (cos 2e(i)t + is sin 2e(i)t) : 


i=1 


230 5 Quantum Mechanics of Finite Degrees of Freedom 


If the coupling constants decrease exponentially with the spin distance, 
e(i) = 2-“+), and s = 0, then 


(N-1)/2 


fn(0,t) = II cos = 


i=l 


and the observables gł 


show a recurrence time that increases as 20 -71)/2, 


gy i 
eis 5 fO, t). (5.173) 
3. Consider f uncoupled harmonic oscillators of masses m = 1 and fre- 


quencies w; described by the algebra of Weyl operators (5.75) W (r), 
r = (q, p) € R*/ and by the Hamiltonian operator 


Using (5.68), the Heisenberg equations of motion (5.169) for position and 
momentum operators T = (q, p) read 


dq . dp A 
-5 = —— =) 
dt "dt a3 
where 9? is the diagonal f x f matrix 2? = diag(wj,w3,...,w7). They 
are solved by 
ee ae oe ae E cos Rt  NQ-!sin Nt 
fi = Uir] = E RAP AS E Nt cos (2t 


Because of linearity, it turns out that the time-evolution maps Weyl op- 
erators into Weyl operators, 
Wr) = e 7) o UIW (r)] =: Wir) = et (AF) = eilr) (21°) 
= W (ri), 


(5.174) 


where r, solves the Hamilton equations for f classical harmonic oscilla- 


tors. Using the notation (5.95), one passes to annihilation and creation 
operators via the relations 


1 qQi/2 i Q—1/2 E 
= Va Que —in-1/2]T: 
1 f 
The Hamiltonian then reads H = 5 5 Wi alai, so that 
i=1 
a(t) = Uilai] = ae ™™ , 


al(t) =Ublal] = af el. 
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4. Example 5.4.2 provides the right algebraic setting for quantizing classical 
dynamical systems as those studied in Example 2.1.3. As much as in that 
case, the quantum time-evolution will be described by a one-parameter 
group {0% hez consisting of integer powers of an automorphism Oy : 
My(C) = My(C). 

Let A = p A of Example 2.1.3 be an evolution matrix with integer 
components and determinant equal to one. According to (2.22), the time- 
evolution of the exponential functions reads 


(Uaen)(r) = e?Tiw lAr) — emi (ATn)r _ eATn(T), AT = K : : 


If one identifies em with Wo(m), then the discrete Weyl relations (5.85) 
can be read as a non-commutative deformation of the fact that the ex- 
ponential functions commute: 


It is thus natural to define the automorphism as 
Oy |Wy(n)] = Wn (A? n) , (5.176) 


and extend it linearly to the whole of My(C). 
In order to be an automorphism, O has to fulfil Oy [1]; then, from (5.84) 
and (5.86), 


On[UN] = e?7*% = Wy(NA(1,0)) = Wn (N(a,d)) 
= e27 (aay +bay— Fab) 


On [VAY] = 7t% = Wy (NA(0, 1)) = Wn (N (c, d)) 


= e27 i(cau+day— Fed) ; 


It thus follows that a discrete representation Wy," of the CCR has to 
be chosen such that 


HORORO 


For instance, when in Example 2.1.3 a = —1, then A = a 5) = AP 


and the choice a,,, = N/2 mod 1 yields a finite-dimensional quantization 
of the Arnold Cat Map [98, 135, 135]. 

Furthermore, the automorphism Oy is implemented by a unitary opera- 
tor Sy : CN + CN which can be determined by means of the equation 


(5.89) once it is represented with respect to the chosen basis {| j ) pare 
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N-1 
(k| Sy le) = ra ner 
p.q=0 
Sktpq = 7 DD p|Wn(=n) la) (k|Wn((=n) |£) . 
N eh, 


Thermal States 


In the following, we shall focus upon states of quantum systems that are 
left invariant by the dynamics, namely we shall be interested in equilibrium 
states. A well-known class of such states is represented by the thermal or 
Gibbs states: ; 
eee 


pp = -z 


, Baie Tehet) , (5.177) 


at inverse temperature T7! = 8 relative to a Hamiltonian H. Let us consider 
a finite level system and the two-point time-correlation functions 


Fyy(t) = Te (pp U(X] Y) , Gxy(t):= Tr(p9 Y U[X]) (5.178) 


for all X,Y € My(C), where the dynamical maps U, t € R, are generated 
by (5.169). Simple manipulations based on the cyclicity of the trace show 
that 


Fos) = Dr tl) = Door tg) 


= Tr(p3 oi H (+8) atnm = Gxy(t+iß). (5.179) 


The above equality expresses the Kubo-Marting-Schwinger (KMS ) condi- 
tions in their simplest form [183, 203] and Gibbs states as in (5.177) are the 
simplest instances of KMS states. 


Remarks 5.6.1. 


1. Only the Gibbs state pg € Myn (C) can have two-point correlation func- 
tions satisfying (5.179); in fact, 


Tr(pXY) = Tr(p¥ Use X1) = Tr(UislX] eY) 
for all Y € My(C) yields 
pX =e PH X FF p => [ef p, X] =0 


for all X € My(C) whence p x e 8". If the KMS condition are taken 
as a signature of thermal equilibrium, the conclusion to be drawn from 
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this example is that for finite degrees of freedom there can be only one 
equilibrium state at a given temperature. Therefore, in order to mathe- 
matically describe phase-transitions one needs infinitely many degrees of 
freedom [300]. 

. At infinite temperature 3 = 0 and the Gibbs states reduce to a tracial 
state (see Example 5.3.3.3): T(X) = Tr(# X). 

. We have seen that faithful density matrices p > 0 are naturally associated 
with the modular operator (5.151). The modular operator defines the 
modular automorphisms of : Tp(Mn(C)) => m(Mn(C)), t € R, given by 


oP lp X)] = Ait p(X) A = pi" @ pot’ n,(X) t @ p-**. (5.180) 


They from a group, the modular group, and preserve the GNS state, 
of|./p) = | vP), so that (5.152) reads 


0° i2(™(X))| VP) = Jom o(X)'1 VP) =|VPX) . 


Further, it turns out that p is a KMS state at inverse temperature @ = 1 
with respect to a” ,, 


(Vil o2 eltp(X)Ito(¥) vp) = T(P- X pity) 
= Tr(p¥ p49 x pitt) = (Vp | mp(¥)0? ey lto(X)] lve) - 


By means of the modular group, when a faithful p is decomposed into a 
linear convex combination of other density matrices 0;, p = >> j Ajoj, its 
decomposers in (5.146) can be recast as follows, 


j05(X) = (VP | p(X) lV PX; ) = (vPI T(X) 08 52h o(X5)] VP) 
= (VP | oF 2[™(X5)] TX) VP) - (5.181) 


The two-point correlation functions Fyy (t), respectively Gyy(t) can be 


analytically extended to the strip {t + iy: —8 < y < 0}, respectively to the 
strip {t+ iy: 0 < y < B} where they are continuous and bounded, including 
the boundaries where they satisfy the KMS conditions (5.179). When the 
Gibbs state (5.177) is a density matrix, pg € Bt (H), these properties which 
are almost obvious in the case of finite-level systems, can be extended to 
systems with an infinite dimensional Hilbert space H [107, 300]. 


Examples 5.6.2. 


1. Spin 1/2: The density matrix in Example 5.5.5 corresponds to Gibbs 
states with Hamiltonian H = wo 3 and temperature 87t such that 


1 l1-—s 0 — 1 ewes 
P=3 0 1+sjJ  2coshßw , 
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2. 
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with s = tanh Gw. Therefore, the modular group consists of 


At — p” @ pt = e ttBwos & ett bwas — eT itBw(73@l2—12@03) : 


Fermions: Let N Fermionic modes, described by creation and annihi- 
lation operators at, i = 1,2,...,N, satisfying the CAR {a;, al} =0 


be equipped with a Hamiltonian operator 


N 
= X j 
H= Ei a; Qi. 
i=1 


This can be regarded as the second quantization of an N-level, one- 
particle Hamiltonian h = un e;|i)(i| € My(C) and a! as the creation 
operator of a Fermion in the eigenstate |i) with energy £;i. 

The partition function Zg is easily calculated since for each mode the 
occupation number states are |0) and |1) (see Example 5.4.1): 


N 1 N 
Z= Tr(e72#) =[[ 5 eem = nie + oF) 


i=1n;=0 i=1 


ajo 


whence the Gibbs state of N non-interacting Fermions read 
A; zi 
Pe = ute + es) eo PF. 
i=1 


In thermodynamics, Gibbs states are canonical equilibrium states, p5, 
while gran-canonical states have the form 


N 
-1 
3° =J] (ite Ble: a) e78 (H-U N) | 
i=1 


where u is the chemical potential and Ñ= D al a; is the number 
operator. Two-point expectations read 
ToS al aj) = 643 =—— , (5.182) 
? efi + z 


where z = e”? is the so called fugacity. As regards higher order correlation 
functions, by means of the CCR anyone of them can be reduced to sums 
of expectations with equal numbers of annihilation and creation operators 
matching in pairs: 


N 
Tr(pg al, . -al aji t aja) = öpaDet ( [TH al a;,)| g a) . (5.183) 


By suitably shifting the one-particle Hamiltonian, one may always assume 
the lowest eigenvalue (ground state energy) to be 0, whence 
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0 < Tr(o§° a} a1) = EE <1, 


for the CAR imply |jata|| < 1, so that z > 0. 


. Bosons: Let af, i = 1,2,..., N, satisfy the CCR [a;, al] = dj; of a 


system of N Bosonic modes with a second-quantized Hamiltonian 


N 
H=Ņ sial ai, E&>0. 


i=1 
The partition function reads 
N œ N =ï 
Zp = Tr(eP#) =| >> em], 
i=l nj=0 i=1 


and the canonical and gran-canonical equilibrium states have the form 


N N 
k> (1-2) eB H p§° =] (1-e Bei Ve B(H—BN) 


i=l i= 


The Bose two-point correlation functions read 


zZ 


while 2N-point ones are of the form 


N 
Tr(pg al, vee al, aji aj) = öpaPer( [re al, aj,)] ‘ a) , (5.185) 


where, unlike for Fermions, 2N-point correlation functions do not assign 
different signs to different permutations whence a permanent appears 
instead of a determinant. Furthermore, with the ground state energy set 
equal to 0, 


z 
Tr(o$° at a1) ee =0<2<1. 
The fact that when z — 1, the ground level can be infinitely popu- 
lated is the source of the phenomenon of Bose-Einstein condensation. 
Canonical and gran-canonical N Bose states as p¢° are Gaussian (see 
Example 5.5.2); indeed, the characteristic functions (5.116) of p5 equals 


N 
Tho W(2)) = [] (1°) Tõe eleeri) 


i= 


In order to compute the trace, it is convenient to split W (z) as in (5.77), 
and to use the overcomplete basis of coherent states (5.108) expressed as 
in (5.105); this yields 
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E P F TE ESNE, S, _le.|?2 sie Baal asics al 
Tr(e Be; a; Gi Q%i di “iat =e |z| /qr(e**e Peiliai a) 


—le.J2 dw; ie Beas. Ar 
=g [zil ay (wi | e7itie BELG; Gig zya; 
T 


= eiae f R: eo lil? (vac | ett? ebenajn (zi wia] vac) . 
T 
Taking into account that (see (5.100)), for any a € C, 

i t ak 

eve al ag g-aea ‘=X > H, [H, RIRN [H, alj] = ea 
E 
k=0 , 
k times 

ecca agt e aeata = er al 


and that (5.76) yields 
etera — ga7/2gaatyal _ gay gyal gaa , a€C, 


one obtains, by Gaussian integration, 


J= aa leak? (vac | et) tie Bea} aig (z: w;)al 
T 


E E 


L fii lwi? liwt] w): 
T 1 — e Bei 


Therefore, one derives the form of the correlation matrix (5.122) from 
N 
7 1 2 pEi 
Tr (p8 W(z)) = exp(—5 2 |zi| coth ) 


1 coth ph 0 z* 
= es ee a 
exp ( 1> z*) ( ü i an) (7.)) , (5.186) 


where h = 3 cili)(i| (£1 = 0) has been used. 

4. The relation (5.87) in Example 5.4.2, allows to equip the quantized hy- 
perbolic automorphisms of the torus T? with the Oy-invariant state wy 
defined by 


wy (Wy(n)) := L Tr(Wy(n)) = (5.187) 


on the Weyl operators and extended by linearity to their linear span, 
where it amounts to the normalized trace. 


5.6.1 Quantum Operations 


A major departure from classical mechanics is represented by the role played 
in quantum mechanics by the measurement processes where a microscopic 
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system, S$, on which the measurement is performed, interacts with a (usually) 
macroscopic system, FE, the measuring apparatus. 

In a classical, commutative context it is always possible, at least in line 
of principle, to make negligible the effects on S due to its interaction with EF; 
instead, in quantum mechanics states are generically unavoidably perturbed 
when undergoing a measurement process. The standard way the quantum 
mechanical perturbations are taken into account is via the so-called wave- 
packet reduction postulate; in its simplest formulation it goes as follows. Let 
X = Xİ be an observable with discrete, finite and non-degenerate spectrum, 
say X = 2a xj Pi, P; := |Y; )( y; |. Upon measuring X on a system 
S, the outcomes are the eigenvalues xj; the measurement process can be 
schematized as follows: a beam of copies of a same system S, all prepared 
so as to be described by a same state p, are sent through an apparatus that 
measures the eigenvalues x; leaving the system state in the corresponding 
eigenprojections P; and direct them towards a screen with d slits. By opening 
the jth slit, the others being kept closed, only those systems on which the 
eigenvalue x; has been measured are collected. Suppose N; of the N systems 
that interacted with the apparatus reach the screen through the j-th slit; 
then, the ratio N;/N approximates the quantity 


p) = Tr(p Pj) = (W lels) 


when N becomes sufficiently large. If no selection is operated, that is if all the 
d slits are left open, after sufficiently many repetitions of the experiment with 
the same state preparation, the collected mixture of systems is described by 
the projections P; weighted with the corresponding mean values pi . Thus, a 
typical non-selective measurement process changes the state as follows: 


d d a 
po SRP = Alel PoP. (6.188) 
j=l j=l j=1 
Fp[p] 


The map Fp is linear on the state-space S(S) and transforms states into 
states: indeed, Fp[p] > 0 and Tr(Fp[p] = Tr(p) = 1, as one can check by using 
the cyclicity of the trace and the fact that the P;’s constitute a resolution of 
the identity, }), Pj = 1. 

In general, the instantaneous change from p into Fp[p| transforms pure 
states into mixtures and may intuitively be associated with the loss of in- 
formation due to the interaction with the many degrees of freedom of the 
macroscopic measuring apparatus. As a consequence, contrary to classical 
mechanics, quantum mechanics distinguishes between two state-changes, a 
reversible one due to the Liouville time-evolution and an irreversible one, the 
wave-packet reduction, describing the action of measurement processes. 
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Remark 5.6.2. Effectively, while being subjected to a measurement, any 
quantum micro-system is to be considered as an open quantum system dy- 
namically and statistically correlated with the (infinitely) many, degrees of 
freedom of the measure instrument. This many-body interaction is usually 
not controllable and the phenomenological description of its overall effects 
is via maps as in (5.188). In particular, a measurement process of an ob- 
servable on a system whose state |) is a coherent superposition of the 
observable eigenstates, |y} = yai c;|w;), transforms it into a mixture 


p= ~ |c;|?| Y; )(; |, with consequent loss of coherence. 

The existence of two basic quantum time-evolutions, one reversible typical 
of closed quantum systems, the other one irreversible and related to measure- 
ment processes, is unsatisfactory from an epistemological point of view. All 
the more so, since the irreversible macroscopic behavior of system plus appa- 
ratus should be deducible from the reversible dynamics of their constituent 
microsystems. Alongside with the problem of reconciling thermodynamical 
irreversibility with microscopic reversibility, quantum mechanics raises the 
question of how to reconcile a reversible microscopic dynamics which pre- 
serves the purity of states with an irreversible macroscopic one which trans- 
forms pure states into mixtures. A number of approaches have been developed 
to attack this problem, for a thorough review of one of them which is based 
on a modification of microscopic dynamics by the insertion of a decoherent 
mechanism with negligible effects on microsystems, but substantial ones on 
macrosystems see [21]. 


It is convenient to extend the notion of wave-packet reduction to that of 
positive operator-valued measures (POVM). 
The key property of a map as in (5.188) is its structure and the use on 
oe af \)= 
the right and left of p of operators such that D P; = iS Pi) = 1. The 


generalization is quite natural. 


Definition 5.6.1 (Partitions of Unity). Let E; € B(H), j € J, be a 
selection of operators such that Yes HIB; = I: it is usually referred to 


as a POVM or a partition of unity. One associates to it the linear map 
Fe : S(S) + S(S), 


Feld = X0 E; p F}. (5.189) 
jEJ 


In Example 5.6.4, we shall discuss the interpretation of generic POVMs in 
relation to measurement processes; for the moment, it suffices to stress that 
the operators forming POVMs need neither be self-adjoint nor orthogonal 
projections. While the von Neumann entropy is constant under the Liouville 
time-evolution; on the contrary, under a generic POVM, it can increase or 
decrease. 
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Example 5.6.3. If p = Q is a pure state (one-dimensional projection), then 
S(Q) = 0, while under the action of a wave-packet reduction, Fe[Q] gets 
mixed and S(Fe¢[Q]) > 0. However, if one starts with a mixed p, S(Fe[p]) can 
be smaller than S(p): take for instance E = |1)(0|, E2 = |1)(1], where 
|0), |1) are a basis in C2, then, EÍ E, + EŽE = 1 and, for all p € S(S), 


Felo] =|1)(O|l0)(1] + [1)(1 lol 1)(1] = [1)(1] Trp) = | 1) (11 - 
Thus, for any given mixed p, S(p) > S(Fe[p]) = 0. 


As we shall see in Section 6.1, the use of generic POVMs , that is not made 
of orthogonal projections, is practically useful when one wants to distinguish 
between non-orthogonal quantum states. However, consider the statement 


measuring the (orthonormal) POVM P := {Pj}j;e7 € B(H) on the system 
S in the state p corresponds to the irreversible map p +> Fp[p] = 2 jeJ P;pP;. 


This has an acceptable interpretation in physical terms for the orthogo- 
nality of the P;’s reduces the experimental measure of P to an experiment 
with #(J) slits. The same argument does not directly work when projec- 
tive POVMs P are substituted with generic £ := {Ej}jes, Djey HE) Bj = 1. 
Consider the statement 


JET 


measuring a generic POVM € := {E;}jez € B(H) on the system S in the 
E;pE}. 


state p corresponds to the irreversible map p > Fefp] = } -jes 

In order to give it a meaning, one has to specify what is measured and 
on which system; indeed, the non-orthogonality of the £;’s makes untenable 
the straightforward interpretation accorded to projective measurements. An 
answer to the above question is given in terms of couplings to ancillas and 
partial tracing. 


Example 5.6.4. [224] Let E := {E;}jez C B(H) bea POVM for a system 5S. 
Let R be an auxiliary system described by a Hilbert space K which provides 
an abstract quantum description of an instrument to which S is coupled 
during a measurement. A schematic description of a measurement process 
associated with € is as follows: 


1. there exist orthonormal bases, {| ~;)} € H and {|k)}x>0 € K, with | 0) 
corresponding to the ready-state of the measurement apparatus; 

2. there is a unitary time-evolution operator U, on H & K such that, at the 
end of the process, at time t = T say, for any initial y € H, one has 


Ur|b)@|0) => Bj|v) @|5) =: |F). 


jed 


The unitary operator Ur is well-defined: indeed, the right hand side of 
the above equality can be taken as a definition of Ur as a linear operator 
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from H & |0) into H & K. Since jes PE; = lls, where Is denotes the 
identity operator on H, scalar products of vectors in the subspace H ® |0) 
are preserved and the isometry Ur can be extended to a unitary operator 
on H & K. Let the compound system S + R be in the state W, according 
to the postulate of wave-packet reduction, by measuring the eigenprojectors 
ls 9 Pr, Pk := |k) k|, k € J, the outcoming (not-normalized) states 


1s 8 P, |Y) (Y |ls © Pe = (Eel) (VIB) @ Pr, 
are obtained with probabilities 
Tmely) = (P | ls @ Pe |W) = (4 | E} Er |Y) . 


By disregarding the ”instrument” R, the overall effect of the entire process 
on the system S alone is as follows: 


— by measuring the projections Ils @ P, on S+ R after the action of Ur on 
|Y) @|0), the state |% } changes into the normalized states 


~. _ _Exlv)(olEt 


| be): , 
(Y| ELE, I) 
with probabilities 7;(w); 


— without selection, the overall effect is 


ID) (bl > So ash) |b; ) (45 | = 2 Bly) (od Bl = Fely. 


JEJ jEJ 


— The process described by (5.189) is obtained by linear extension of the 
action of Fe from projectors to mixtures of projectors. 


Remark 5.6.3. The previous one is an example of dilation of a CP map to 
a unitary evolution on a larger system from which the former is obtained by 
partial tracing [109]. In general, any POVM € := {Ej}je7 C B(H) can be 
dilated to a projective POVM P = {P;}je7 consisting of orthogonal projec- 
tors P; on a larger Hilbert space K [143]. For POVMs such that card(J) = d, 
the proof goes as follows: consider the Hilbert space Ke = H @ C4 linearly 
spanned by vectors of the form 


d 
[ye => Ejlvy) Sli), 
j=l 


where | Y; ) € H and {|j)}¢_, is an ONB in the auxiliary Hilbert space C*. 


Let |) € Hand set |Wy)e := ae E;|@) |j). The operators P; on Ke 
defined by 
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Pil) = Ejip) 813) 
are orthogonal projections such that De P; = 1 on Kg and the projective 
POVM P = {P;}4_, C B(Ke) is such that 


d d 
SPU ee(Uel Py = X Elo ol BL SliNi] 


j=l j=l 


whence (5.189) results by tracing over the auxiliary Hilbert pace C?. 


5.6.2 Open Quantum Dynamics 


Despite the practical impossibility of describing the interaction between 
a micro-system and a macro-system during a measurement process, it is 
not without hope to try a dynamical derivation of the wave-packet reduc- 
tion (5.189). The idea is that the latter is a time asymptotic effect of a 
many-body interaction whose time-scale is much shorter than the duration 
of the process. The phenomenological description of the process cannot be 
given in terms of an automorphism U: on one hand, U is reversible, while 
the wave-packet reduction is not, on the other hand U; cannot transform pure 
into mixed states. 

A straightforward way to extend the quantum time-evolution beyond the 
reversible one generated by the unitary Liouville equation is to add some 
extra structure to (5.168). One observes that the commutator corresponds to 
a linear action on the state-space S(S), and that the generated dynamical 
maps U; satisfy the composition law U; o Us = U;+5 for all s,t € R. 

A sensible step is to modify (5.168) by adding to the commutator a linear 
term that breaks time-reversibility and generates a semi-group I+, t > 0, of 
linear maps obeying a forward-in-time composition law Jo Is = I.4;, where 
now s,t > 0. Namely, one tries to substitute (5.168) with a time-evolution 
equation of the form 


dolt) = Lulo(t)] + DIe) - (5.190) 


Formally, the semi-group of linear maps {I;}:>0, solutions of (5.190), is ob- 
tained by exponentiating the generator: 


p= pelt) = Rl], eae’, L= Ly +D. (5.191) 
Not all linear maps D lead to physically consistent irreversible time-evolutions 
I; the following conditions result necessary: 


1. Tr(D[p]) = 0: since Tr([H, p:]) = 0, this implies trace-conservation 
O,Tr(pr) = 0; 
2. D{p|* = D[p]: this guarantees preservation of hermiticity; 
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3. the positivity I|] must be preserved at all times t > 0. 


While the first condition can be relaxed, for instance in the case of de- 
caying systems [11], which we shall not consider, the other two conditions 
are instead necessary to ensure that I, map density matrices into density 
matrices. However, positivity-preservation alone does not suffice for the full 
physical consistency of I}, the stronger property of complete positivity dis- 
cussed in Section 5.2.2 turns out to be necessary. Despite its mathematical 
origin, this notion is deeply rooted in quantum physics. Its importance was 
firstly appreciated in the theory of open quantum systems [11, 96, 181]. 

Equations of the form (5.190) that lead to semi-groups of dynamical maps 
that break time-reversibility are usually derived when one thinks of S as a 
subsystem immersed in a large (infinite) reservoir, or heat bath, R. Practi- 
cally, one deals with a situation similar to the one in Example 5.6.4 and uses 
a partial tracing technique: the system S is not closed, but coupled to a large 
system R. The system S+ R is described by the tensor product Hilbert space 
H ® K, its states pg;p are density matrices on such a space and evolve in 
time according to the unitary time-evolution 


UstR 


Ps+R > ps+r(t) =UP""[ps+r] = Us+r(t) Pps+r U} p(t), 


generated through (5.168) by a Hamiltonian of the form 
H= Hs Het hop; (5.192) 


where Hs r are the Hamiltonian operators describing the reversible time- 
evolutions of system and reservoir alone, while H; takes into account their 
interactions with À an adimensional coupling constant. 

The interaction Hamiltonian is such that there are practically no hopes 
to arrive at an explicit unitary time-evolution Ug;pr(t). On the other hand, 
one is interested in the dynamics of the open system S alone. Furthermore, in 
many situations of physical interest, one may reasonably assume that there 
are no statistical correlations between system and reservoir at t = 0; namely, 
the initial state of the compound system can be taken of the factorized form 
Ps+R = PsSpr Of the initial condition. Then, by tracing over the environment 
degrees of freedom, one obtains a one-parameter family of dynamical maps 


ps => ps(t) := Trx(ps+r(t)) (5.193) 


on the state-space of S which is called reduced dynamics. 

Together with the fixed form of the initial condition, the elimination of the 
environment degrees of freedom by means of the partial trace Trg makes the 
evolution irreversible. The factorized initial condition does get entangled in 
the course of time, so that, in general, the family {p5(t) }:>0 of states satisfies 
a highly complicated integro-differential evolution equation of the form 


drps(t) = | ds Dls- s), (5.194) 
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in which the linear operator LÀ on the state space S(S) of the system exhibits 
memory effects that account for the entanglement of the system with the 
reservoir from time t = 0 to time t > 0. Before dealing with how one can 
eliminate the memory effects and get an evolution equation that generate 
a semi-group of maps on S(S), it is convenient to examine (5.193) in some 
more detail. 

Let for sake of simplicity assume that the initial state of the reservoir 
is described by a density matrix be pp = 2; pil PEYR |, where p; > 0, 
5 jPj = 1, and the pr form an orthonormal basis in K. We use them to 
calculate the partial trace Trg: 


R 
=X p (UF User) YF) ps (dF |US, Rl) YF) - 
ay 
Notice that the matrix elements provide operators 


Valt) = yp (Y? |Usir(t) YF): HOH, 


so that the reduced dynamics corresponds to maps 


ps = Alps] : = 2 Vut ) Ps Vit). (5.195) 


According to Proposition 5.2.1, the resulting dynamical maps are CPU with 
the V;;(t) as Kraus operators. However, they do not form a semi-group be- 
cause of the memory effects built in the integro-differential equation they sat- 
isfy. Under the hypothesis of a very weak coupling between S and R (A << 1), 
a semi-group reduced dynamics is obtained by performing suitable Markov 
approximations, the most straightforward being the substitution 


A ds L3[ps(t — s)| > Lips(t)] = o i 2) ae 


Since the memory effects have been eliminated, L is a generator corresponding 
to a Liouville equation or master equation as in (5.191). 

In the so-called weak-coupling limit [105], the Markov approximation 
sketched above can be understood as follows: an expansion to second order 
in the small coupling constant À shows that (5.194) becomes 


t 
Orps(t) = —i[Hs + ” Hı , ps(t)] + xf ds D(s)[ps(t — s)| , 
0 
where D[-] acts linearly on the state space S(S). The effects due to the 


presence of the reservoir are thus visible on a time-scale T = tA? which is 
slow as À < 1; by rescaling the evolution equation reads 


TX 
d,ps(TA~*) = -i[A-7 Hs + Hı , ps(tdA~*)] + | ds D(s)[øs(Tà7? — s)] . 
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Then, by letting À — 0 one replaces the upper integration limit by +00 and 
neglects s in comparison with T\~? in the argument of the state appearing in 
the integral. The problem with too naive Markovian approximations as this 
one is that very rarely they lead to irreversible evolutions that are positivity 
preserving [105]: most derivations provide time-evolutions that are not posi- 
tive and generate physically inconsistent negative probabilities. For instance, 
the wild oscillations due to the system Hamiltonian term \~? Hs when A > 0 
makes intuitively plausible using an ergodic average to smooth away too fast 
effects [95]. 


Irreversible Dynamics within the Bloch Sphere 


With respect to Example 5.6.1.1, it proves convenient to represent density 
matrices p E€ M2(C) by Bloch vectors with one more component correspond- 
ing to the coefficient of oo in the expansion (5.104). 

We shall identify p as a 4-dimensional ket R* > | p) := (1, p1, p2, p3). As 
a consequence, the linear action of the generator L : p + L[p] in (5.191) 
corresponds to a 4 x 4 matrix £ = [L,,,] acting on |p). The Liouville equa- 
tion (5.168) thus becomes 


with —2H and —2D 4 x 4 matrices corresponding to the commutator Ly 
and the added term D in (5.190) (—2 has been inserted for convenience). 
Concerning the matrix D, the request of trace and hermiticity preservation 
imposes Do; = 0, j = 1,2,3, and D,, € R. By splitting D into the sum of a 
symmetric and antisymmetric matrix, the latter corresponds to a Hamilto- 
nian contribution that can be incorporated into H. Thus, one remains with 
a purely dissipative matrix 


0 0 0 0 
ua be 

D= FFEIT (5.196) 
w c y 


with 9 real parameters which depends on the phenomenology of the system- 
environment interaction and can, in line of principle, be tested in dedicated 
experiments [32]. 

By exponentiating £, one gets a one-parameter semi-group of 4 x 4 matri- 
ces, {G:}>0, such that G, = e~?!+?) which corresponds to the semi-group 
{Li }>0 on the state-space S(S) given by 


3 


pr plt) = Til => pult) ou, Pult) = (Gro)y - 
u=0 


Since the trace is preserved at all times, checking positivity preservation 
amounts to checking whether Det|p(t)] > 0 for all t > 0 and for all initial p. 
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Since the contributions of the anti-symmetric H cancel out, the time- 
derivative of the determinant reads 


3 3 
. dDet|p(t 
Djp] = eet et) o am (> Dis pip; + > Prop) 
= i,j=l j=l 
lə+n- o 
Let p be a pure state P(n) := = = 4 then Det[P(n)] = 0. Therefore, 


T,[P(n)] > 0 asks for D[P(n)] > 0 and the same must also be true for the 
orthogonal projector P(—n). By summing D[P(n)] > 0 and D[P(—n)] > 0 
and varying n in the unit sphere, positivity is preserved only if 


a be 
D®=[b a B|>O0. (5.197) 
c yJ 


The positivity of D@) is necessary for positivity preservation, but not suf- 

ficient, the reason being that D[P(n)| < 0 can follow because of the extra 

term Da Djopj. However, it becomes also sufficient when we ask that I; 

increase the von Neumann entropy of any initial state, as this is equivalent 

to u = v = w = 0 in D. Indeed, given any initial p, let it be spectralized as 
Lio | l-n-o 


PH fri 2 r T2 2 ’ 


with 0 < r12 < 1, n € R? and $7?_, n2 = 1. Then, one explicitly computes 
> j=1 j 


(3) omt a 
af(ri —ra)(n|D aot Pita 


S(p): 


t=0 


If Djo = 0 for j = 1, 2,3, then the time-derivative is positive because of the 
positivity of D and due to the fact that (rı — r2) ln r1/r2 > 0; if not, one can 
always find a p for which S(p) < 0: it suffices to choose r1—r2 sufficiently small 
and adjust n to make negative the second term in the previous expression. 


Example 5.6.5. Let us consider the following simple master equation for a 
2-level system S, 
Op(t) 1 


ae 5 Pe — 02 p02 + 03 p03 — P) . 


Using (5.56), L[1] = Lio] = Llo3] = 0, while L[o2] = —2 02; therefore, the 
generated semi-group y+ = exp(t L) is such that 


1 = 
yle] = 5 (a + p101 + e7” p202 + p303) ; 
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0 0 0 
Since the matrix in (5.197) is now D®) = | 0 1 0 J, the necessary con- 
0 0 0 


dition for positivity preservation is satisfied. Further, the Bloch vector at 
time t, p(t), is such that ||p(t)|| < |||]; therefore, any initial density matrix 
remains a density matrix. 

Suppose the 2-level S system evolving under y is statistically coupled to 
another 2-level system 9” that has no evolution of its own. Then, one has to 
consider states of the composite system S’+ S$ that evolve in time under the 
semi-group of maps of the form I; = idz ®j, that is I; lifts the action of yz 
from M2(C) to Mo(M2(C)) = M2(C) ® M2(C) as in Section 5.2.2. 

Among the possible initial conditions for T, there is the Bell state |Woo) 
in (5.164); we know from (5.17) that the corresponding projector P2 is pro- 
portional to the matrix E = [E;;] € M2(M2(C)) whose entries are matrix 
units in M2(C). By writing Ey, = (1+ 03)/2, Ei2 = (01 + io2)/2 and 
E22 = (1 — 03)/2 in terms of the Pauli matrices, it turns out that 


a 1 
P= (181+ 80 — 02902 + 03 8 03) ; 
Then, setting \; := exp(—2t), under the time-evolution I;, P2 evolves into 


a 1 
EP? = ;(18 1+ 01 @ 01 — A102 B 02 + 03 B08) 


2 0 0 1+ 
il 1+ 03 o1 + iApoa il 0 0 1— àt 0 

4 \oi— io 1-os ~ 4 0 b= Ay 0 0 j 
1 dy 0 0 2 


which is not positive definite for any t > 0, for it always shows a negative 
eigenvalue (A; — 1) /4. 


The physical meaning of the previous example is that, though yj; is a 
meaningful time-evolution for one 2-level system, I; is not so for two 2-level 
system as there exists a state of the two together which does not remain 
positive definite. Notice that the state which exposes the problem is entan- 
gled; indeed, any separable state, as in Definition 5.5.3 would remain positive 
under I: as yelp] > 0 for all p € S(S), 


T |X Auh 05 | =X ago @ vel] = 0 - 
ij ij 
The importance of Theorem 5.2.1 is now apparent: physical transformations 


of an N-level system S cannot be described by linear maps A that are only 
positivity preserving, they must also be completely positive. Otherwise, by 


5.6 Dynamics and State-Transformations 247 


coupling S with another N-level system S”, one would obtain a map idy @ A 
which would map the initial entangled state PY into a non-positive definite 
matrix. 

The standard quantum time-evolution U+ is automatically in Kraus form, 
thus completely positive and free from inconsistencies with respect to statis- 
tical couplings to ancillas. It is only when performing a Markovian approx- 
imation that one must check that complete positivity be guaranteed by the 
procedure [95, 96, 105, 285]. 


5.6.3 Quantum Dynamical Semigroups 


Positivity and complete positivity depend on the dissipative term D[p] added 
to the commutator in (5.190): it turns out that when one asks that the 
generated semi-group consist of completely positive maps, then the form of 
the generator is completely fixed. 


Theorem 5.6.1. [126] Let {y:}:>0 be a one-parameter semi-group of her- 
miticity preserving, unital linear maps y% : Ma(C) => Ma(C) such that 
limz+o0Y = ida with respect to the norm-topology. Then, 


1. the semi-group has the form 7, = exp(tL) with generator 


L[X] =i[H, x] È S Ca( FÌ X P, z L FIR, x}) 


a,b=1 


where the matrices F, form an ONB in M,(C) with respect to the Hilbert- 
Schmidt scalar product with Fp = la/vd (Tr(F.) = 0 if a # d? — 1)) 
and the (d? —1) x (d? — 1) matrix C := [Cab], called Kossakowski matriz, 
is Hermitian. 

2. The maps y are completely positive if and only if [Cab] is a positive 
matrix. 


Proof: From Example 5.2.7.1, the linear maps Fap : Ma(C) > Ma(C) 
defined by Fap[X] := Fİ X Fy, a,b = 1,2,...,d?, form an orthonormal basis 
in the d? dimensional linear space of all linear operators on Ma(C) equipped 
with the Hilbert-Schmidt scalar product of the associated Choi matrices. It 
follows that the generator L can be expanded as L = Ta Lab Fav. Then, 
the request that the generated semi-group preserve hermiticity implies that 
L[X]t = L[X") for all X € M,(C) which in turn yields L*, = Lya. Now, after 
rewriting 


d?-1 a? 
1 
L[X)=FX4+ XFt+ XO Las F’aga X Fy, F := z2 Loe F}, 
a,b=1 a=1 
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and separating F into its Hermitian components, F = K + 7H, where 


d? d2 
1 
K := g (Lae FÅ + LeaFa), H= Dlhe Pelni 
one concludes that 
d?-1 
LIX] =iH, X] + (KX+XK)+ SO Le FIX. 
a,b=1 


The first statement of the theorem follows by further imposing unitality, that 
is that 7,[1] = 1 for all t > 0. One thus gets that L{1] = 0, which further 
d?—1 
1 

i > E t 
imposes K = 5 2 Lap Fy Fo. 

According to Theorem 5.2.1, y is a CPU map on M,(C) if and only if 
I, := ida ® y is a positive, unital map on Ma(C) ® Ma(C). Notice that the 
maps I; form a norm-continuous semi-group with generator Liz := ida Q L; 
then, according to [177, 178] (see also [64]), the maps ida @ y+ are positive if 
and only if 

Iy, 6) = (Y| Lisl] ¢)(e|] |v) = 0 


for all orthogonal Y, ¢ € C? @ C4. Since (|) = 0, it follows that 


d?—1 


H= D> Cas (14149 F} A)| la 8 Fo Iw)) - 


a,b=1 


Then, it proves convenient to define the d? x d? matrices Y = [y;;] and = 
[¢;;] where w;; and Qij are the components of the vectors w and ¢ with respect 
to a fixed ONB {| i, j )}f j= in C? @ C’. Notice that (y|¢) = Tr(W'6). By 


introducing the vectors |v) € cr-1 with components given b 
g p g y 


= ($| 14 Q Fa |Y) = Tr(Fa(®'v)*) , 


d?’—1 
one then rewrites I (Y, ¢) = 5 Cab VZ w = (v |C |v). EC = [Cab] > 0, then 
a,b=1 
I(w, ¢) > 0 for all orthogonal Y, ¢ € C? @ C4, whence I; is positive. 
Vice versa, given a generic vector |v) € € C’-1, the traceless matrix 


Ma(C) > Y := a aoi Va Fa corresponds to a vector Y € C? @ C? that is or- 
thogonal to the non-normalized totally symmetric vector |Y) = SG 7" |i). 
If I; is positive, then (4, #4) = (v|C|v) > 0 for all |v) € CP-1 whence 
C> 0. 
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Remarks 5.6.4. 


1. If {y}:50 is a norm-continuous one-parameter semi-group of positive 
maps yı : Mqg(C) + M;(C) with generator L and Y, € Ct are orthogo- 
nal vectors, then 


O< (dl vullo) Olly) = t(v| Lie) ll lY) 


to first order in t. This yields the only if part of the theorem used in the 
previous proof; it turns out that this condition is also sufficient for the 
maps 7; to be positive [177, 178, 64]. 

2. The extension of Theorem 5.6.1 from Ma(C) to B(H) with H an infinite 
dimensional Hilbert space, has been provided by [193] under the assump- 
tion that || L[X]|| < ILII || X|| for all X € B(H), namely that the generator 
L be bounded on B(H). 

3. By duality, one gets the following time-evolution equation for the states 
of the open quantum system S: 


Lio] = ilH, p] + a Ca (Fp F} — L FIR, pt) 


a,b=1 


Since the y, are unital, their dual maps q; preserve the trace of p. 
d?-1 
4. If y% is CP , the expression 5 Cab Fo pFt can be put in Kraus form 
a,b+1 
as in (5.36). Such a term corresponds to what in the classical Brownian 
motion is the diffusive effect due to the presence of a white-noise. It is 
indeed sometimes called quantum noise which is also in agreement with 
the effects of generic POVMs on quantum states [121]. 
5. Beside the noise contribution, the remaining part of the generator has 


the form 
i i ae 
(H — -K)p + ip(H+ 4K), K=- Lab FS F, . 
i( > )e+ip( 7 ) 3 2o b Fa Fp 


This expression corresponds to the typical phenomenological description 
of the time-evolution of decaying systems; in particular, K is a damping 
term due to probability that goes irreversibly from the system S' to its 
decay products. 

6. Regarding the generated maps y+ = exp(tL), there are no general results 
on the form of the Kossakowski matrix C = [C;;] able to ensure that the 
yi be positive; the only available general expression is for d = 2 [178]. 


Semigroups consisting of completely positive maps are called quantum 
dynamical semi-groups. Their derivation as Markovian approximations of an 
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underlying reversible many-body dynamics mainly follows three schemes, the 
already mentioned weak-coupling limit, the singular-coupling limit [126, 125, 
230] and the low density limit [104]. All of them work when the time scales 
of the system S and of the reservoir R are clearly distinguishable. The weak- 
coupling limit is the one most frequently encountered in the literature since 
the beginning of the theory of open quantum systems and also the one which, 
if not performed with due accuracy [96], leads to semi-groups of maps which 
are not completely positive and thus to physical inconsistencies in relation to 
entanglement. 


Example 5.6.6. [27] Let d = 2; in such a case, by choosing the orthonormal 
basis of Pauli matrices F} = a;/ /2’s, j = 1,2,3, the dissipative contribution 
to the semi-group generator reads 


For sake of simplicity, we shall restrict to entropy-increasing semi-groups. 
Then, we can consider the matrix D in (5.196) whose entries read 


a = C22 + C33 , A= Cia + C33 , y = Cii + Caz 
b=—-Cyz2, c= —Ci3 , B= —Co3 . 


Thus, the positivity of [C;;], which, according to the previous theorem, is nec- 
essary and sufficient for the complete positivity of I, results in the necessary 
and sufficient inequalities for a, b,c, a, B, Y: 


2R=at+y-a>0, RS >b 
25S=a+y-a>0, RT>¢ 
IT=ata-—y>0, ST >Q 
RST > 2bceß + RB + Se +TV. 


These constraints are much stronger than those coming from positivity alone, 
that is from DC) > 0 in (5.197) which yields 


a>0,a>0,720,aa>b, ay>c’, ayz 8? 


and DetD®) > 0. As a concrete example, take a = a and 8 = b = c = 0; so 
that the Kossakowski-Lindblad generator reads 


2a — 
Lp] = Z (opo =p) + > (o2poz — p) + Y lospos —p), 
whence L[o1,2] = —a 01,2 and L[o3] = —yo3. It follows that, when a,y > 0, 


the generated semi-group y+ describes a decay process towards px. = 1/2 
with different rates for the diagonal and off-diagonal elements of [p]. 
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Indeed, setting ju := exp(—yt) and A; := exp(—at), it turns out that 


1+ pps ay 


Io] = 5 (Ut Mloo: + p202) + mos) = 5 
=r oO oO o3)=-7 : 
Yi lp. 2 tlP101 T P202 MtO3 2 \ r+(p1 + ipa) 1— pups 


On the other hand, consider T, = id2 ® yj and P? as in Example 5.6.5, it 
turns out that 


a 1 
nP] = (lettre (01 ® 01 — 02 Q 02) + p03 08) 
1+ pe 0 0 2At 
if © iep 0 0 
= 0 (6 im 6 
2At 0 0 1+” 


This matrix is positive definite if and only if 1+ yz > 2+. This is implied 
by the complete positivity condition y < 2a, whereas if y > 2a, when t — 0 
one gets 

1+ py — 24, ~ t(Qa—y) <0. 


In conclusion, only complete positivity guarantees full physical consistency 
with respect to statistical couplings with other systems. However, this im- 
poses a hierarchy, y < 2a, upon the decay rates of the entries of the dissipa- 
tively evolving state 7;[e], which should otherwise only be positive. 


5.6.4 Physical Operations and Positive Maps 


The argument behind the request of complete positivity on state transfor- 
mations is that one can never exclude that the system S undergoing the 
transformation is indeed entangled with an ancilla system S’, even without 
any effective sign of statistical correlations. Though plausible, this point of 
view is not always accepted [235]; after all, the mere possibility of entan- 
glement with an uncontrollable ancilla would then, via complete positivity, 
constrain the decay properties. Consider, for instance, an actual experiments 
where optically active molecules interact weakly with a heat bath; they can 
effectively be described as 2-level systems. The relaxation to equilibrium of 
their optical activity can accordingly be predicted by an appropriate master 
equation. Clearly, the fact that the optical activity may depend on whether 
the molecules are entangled with some other system out of any experimental 
control sounds admittedly weird [72, 185, 186, 292]. 

However, most of the objections to complete positivity do not consider 
the entanglement issue for they all focus upon single open quantum systems 
in heat baths. If, however, two optically active molecules in a same environ- 
ment are considered, the entanglement issue comes to the fore. If the two 
molecules do not interact between themselves, but are weakly coupled to 
their environment, it is sensible to describe their open dynamics by a semi- 
group of dynamical maps of the form I; = y% Q Y+, where y is the reduced 


252 5 Quantum Mechanics of Finite Degrees of Freedom 


dynamics of a single molecule. These dynamical maps differ from idz ® y+ in 
Examples 5.6.5 and 5.6.6. 

Notice that in going from ida ® y+ to I} = y% D Y+ one effectively goes 
from the possible existence of statistical correlations between the system Sq 
and another system of the same type which is somewhat uncontrollable, to a 
concrete scenario when one has two statistically coupled systems in a same 
environment. The following result on one hand extends Theorem 5.6.1 and 
on the other stresses once more the fact that complete positivity is not just a 
mathematical option without physical meaning, rather an unavoidable con- 
straint on all sensible Markovian approximations. 


Proposition 5.6.1. /37] Let {y:}i> be a norm-continuous semi-group of 
dynamical maps y+ : Ma(C) > Ma(C) with generator as in Theorem 5.6.1. 
Then, the linear maps I; = + y+ form a norm-continuous semi-group on 
Ma(C) ® Ma(C) and preserve positivity if and only if y+ is a CPU map for 
all t > 0. 


Proof: One implication is straightforward: if y is a CPU map, then 7;,@id2 
and id, © y are positive and such is I}. 

For the other implication, notice that, in view of the assumptions, the one- 
parameter family {I;}:>0 is a norm-continuous semi-group with generator 
Ing = L Qida +id4 8 L. Then we argue as in the proof of the second part of 
Theorem 5.6.1 and show that 


Iy, p) = (Y| Lillo) olly) = 0 
for all orthogonal 7), ¢ € C4 & C4. Since (y |¢ġ) = 0, it follows that 


d?-1 


1(,9) = Y Cas ((Y| FL @ 1416) 6| Fe Tal) 


a,b=1 
+(|la® Fi|o)(¢|t,@ Fly) « 
By means of the matrices Y = [pij] and ® = [¢,,;] associated to the vectors 


p, o E€ C? Q C? as explained in the proof of Theorem 5.6.1, one introduces 
the vectors |w), |v} € C’ —! with components given by 


w = ($| Fla |b) = Tr(F,(W")) , v = (4| USF |v) = T(F(O'D)*) , 


and rewrites 


d?-1 


KY, p) = J Car (wae + viv) =(w|Clw)+(vlClv) — (*) . 


a,b=1 
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Given |w) € C2-11, construct Ma(C) 3 W := DA Li Wa Fa. Since a matrix 
and its ao ae are always similar [134], let Y € Ma(C) be such that 
WT =YWY~—! and define 6 := Y—!, Wi := Y W so that 


yt =W, (Wtd)T=(YWY')?=W and |w)=|v), 


whence (*) becomes I(w,¢) = 2(w|C|w) > 0. Observe that |w) € C-t 
is generic and that to any such vector one can associate orthogonal vec- 
tors Y, € C? @ C4 through the matrices Y, ® as described above ĉ. Thus, 
I(w,¢) > 0 for any such pair implies C := [Cab] > 0. 


Remarks 5.6.5. 


1. If positive, T; = y+ 8 y+ is also CP; indeed, using (5.36), it turns out that 
W(X] =}; ™ X,V;(t), X € Ma(C). As a consequence, 


ee t) @V{(t) X V;(t) @V;i(T) . 


2. The equivalence between the complete positivity of y+ and the positivity 
of I; = y Q y does not extend to the tensor products of generic q2 : 
indeed, in Proposition 6.2.2 it will be shown that IT, = ny) Q P can be 


positive without nyt?) being both CPU maps. 


Once a semi-group reduced dynamics is accepted as a phenomenological 
time-evolution under certain physical conditions as those compatible with, 
for instance, the weak-coupling limit scenario, there is only one possible way 
to get rid of the complete positivity constraint. One has to rely upon the 
existence of physical mechanisms that eliminate those initial entangled states 
that, like the symmetric projector P3, would otherwise be cast out of the state 
of space by I; when 7; is not completely positive [122, 124, 292, 319]. 

In quantum information the situation is physically clearer and complete 
positivity compulsory. In fact, the state transformations that are commonly 
considered do not from dynamical semi-groups arising from suitable Marko- 
vian approximations, rather they are maps as in Definition 5.6.1. Indeed, the 
simplest operations are local state transformations that two parties operate 
on shared entangled states as Pe. In order to be physically consistent, these 
local operations must correspond to completely positive maps. 

What then of positive maps? If, on one hand, the existence of entangle- 
ment in nature forbids their use as dynamical maps that describe actually 
occurring physical processes, on the other hand, as we shall see in the follow- 
ing chapter, they are extremely useful as entanglement witnesses. 


°Notice that, as (Y| ¢} = 0, the matrices 6¥' and W'@ are traceless. 
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6 Quantum Information Theory 


In the last years, a considerable amount of theoretical and experimen- 
tal studies have been focussing on the impact that quantum mechanics 
may have on computer science, information theory and cryptography. We 
shall loosely refer to this vast and variegated field as quantum informa- 
tion [71, 48, 100, 128, 224, 242, 152, 239, 307]. In the following, we shall 
briefly touch upon a small fraction of its many achievements. 


6.1 Quantum Information Theory 


Why quantum information? Is it not classical information sufficiently power- 
ful a theory to satisfy our needs? The answer is that it will indeed be so until 
computational models and information transmission protocols are based on 
classical physics. Indeed, information is physical [187, 55] for it is carried by 
physical entities, transmitted and manipulated by physical means; as a con- 
sequence, any actual information processing protocol will rely upon a model 
describing the physical processes involved. Since Nature is considered to be 
ultimately quantal, one is inevitably led to consider a scenario in which quan- 
tum mechanics will set the rules of the game also in dealing with information 
and its manifold aspects. 

Roughly speaking, the issue at stake is the use of qubits instead of bits as 
fundamental informational resources so that one has the whole Bloch sphere 
of two-level system states at disposal instead of just the two states (up and 
down along the z-direction) that are available to classical spins. 

When the information that is manipulated regards computational pro- 
cesses, the question is whether Quantum Turing Machines (QTMs), that is 
computing devices based on the laws of quantum mechanics, might perform 
better than classical Turing machines. A breakthrough was indeed the dis- 
covery that relevant speedups can be gained by quantum algorithms because 
of the huge parallel computation made available by the possibility of linearly 
superposing qubits states. 

Truly, from an abstract perspective, as much as classical mechanics is 
contained in quantum mechanics, also classical information, computation and 
cryptography may be thought of as commutative versions of more general 
theories, still in their infancy, that are to be soundly formulated within a 
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quantum, non-commutative, framework. However, the need to elaborate these 
more general theories is not only justified in line of principle, but comes from 
concrete facts. The pace at which every two years electronic devices double 
their efficiency (the so-called Moore’s law) and decrease in size is such that 
non-classical effects will soon appear and quantum mechanics will become 
necessary to cope with them. 

If information is carried by qubits , then the possible reversible operations 
to which they can be subjected are all those corresponding to unitary matrices 
in M2(C): these are called quantum gates. In the classical case, the only non- 
trivial gate on bits 0,1 is that which flips them, 0+ 1, 1+ 0. Consider, for 
instance, the Hadamard transformation in (5.58); its n-fold tensor product 
UÏ” acting on |0)®” produces a uniform linear combination of kets labeled 


by the 2” binary strings i” € RP, in one stroke: 


n n 1 n 
u a = a Yo ee (6.1) 
Me Q™ 


Linearity is at the basis of quantum parallelism: suppose that the computation 


of a binary function f : a) = a on n bits with n-bit strings as images, 


i™ > (£(i™))™, can be operated by means of a unitary transformation 
Li) o Uli) = EEO) 


on n qubits. By Uy f is computed on all strings at once, as follows: 


n n 1 LTE n 
UUR 0)” = a KEE ™ y 
an (n) 

im EQS” 


The linear structure of quantum mechanics seems to provide a more powerful 
setting than the classical scenario; however, the extraction of information out 
of quantum states is a much more delicate problem than with binary strings. 

Any computation performed by a QTM on n qubits must correspond to a 
unitary operator on (C?)®”; then, a quantum algorithm acting on an initial 
state of the n qubits would halt in a linear combination of all possible compu- 
tational basis vectors in (C?)®”, each one of them corresponding to a classical 
n-bit string i occurring with a certain amplitude C(i™). If the solution 
of a problem is a specific binary string i), an efficient quantum computa- 
tion must associate to that string a very high probability, |C(é™)|? ~ 1. 
Only in this case the solution would show up with almost certainty from a 
measurement in the computational basis. 


Example 6.1.1 (Deutsch-Josza Algorithm). Let f : Re + {0,1} be 
a binary function that is known to be either constant or balanced, that is 
f(#™) = 0 on half of the n-digit strings and f(i™) = 1 on the other half. 
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The task is to decide between the two possibilities. Classically, the only way 
to ascertain whether f is constant or not is to compute it on 2"~!+1 strings, 
that is on half plus one of them; this is because one can compute always 0 
or always 1 on 2”/2 strings in a row without the function being constant, so 
that only one more computation can settle the question. On the other hand, 
if the bit strings i could indeed be treated as computational basis vectors 
|i) in the Hilbert space H™ = (C2)®” of n qubits, then the following 
quantum algorithm would answer the question in just one trial. It is based 
on a generalization of the CNOT gate of Example 5.5.9; instead of one control 
qubit, there are n of them all prepared in the same state |0) together with 
one target qubit in the state |1). As seen in (6.1), 


|) = Ug" |0)%" @|1) = (=) So a) ee 


MEQ” 
Th ix Mon(C) ® Ma(C) 3 U; = 5) 9 (4g | @ ofF™ | 
e matrix Mən (C) & M2(C) > Us Lyme lj I |8 oy is 
unitary and flips the last qubit only if f(j‘”) = 1. This yields ! 


1p) = uw) = () Sy fa) @ WEF woes )) 


MEQ” 


1 $ a” s(n 0)=]1 
= (=) SP (Fe) 4) a z 5 


ime 


Applying the Hadamard rotation on the first n qubits (see (5.58)), one gets 


z ji 1.4" KONORO 
IY) := US” @ IY) = (=) ( So gep Hamy )) D 
j™) 


ar) col 


o =I) 


V2 


where i™ . j™ := 7"_, ikje. Since projecting onto |0)®” yields 


14” s(n 0)—|1 
(z) E (e | joyer @ SAT, 


men” 


the amplitude of |0)®”" in |v) is 0 if f is balanced, +1 if f is constant. 
Therefore, after operating the circuit one has just to perform a measurement 
in the computational basis {| 4 dimen Of the first n qubits : if |0)®” 
occurs the binary function is constant, otherwise it is balanced. 


"The CNOT gate (5.165) corresponds to choosing f(i) = i, i = 0,1. 
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The preceding discussion regards instances of classical information being 
encoded into quantum states and manipulated by quantum gates. Similarly, 
classical information might be stored or transmitted by quantum means and 
the question is then how to retrieve it with high reliability or fidelity. Sending 
classical information encoded into non-orthogonal quantum states has indeed 
the advantage of being protected against undetected eavesdropping. 


Example 6.1.2 (No-Cloning). Suppose a sender A encodes the bits 0 and 
1 into the qubits | Yo), | Y1) € C? with | Yo) Æ | Y1) and sends them to a 
receiver B. If a spy E wants to access this amount of information without 
being spotted, he/she has to read the transmitted state without changing it, 
otherwise sender and receiver might get alerted. A way to do this is for the 
spy to intercept the message during transmission and to copy it by means of 
a unitary operator Ug acting as follows Ug(|w) ®@|e)) = |4) 8 |y). But 
unitarity implies 


(volti) = (Yo Belyi @e) = (yo @e|ULUE hh @e) = ((volvr)) , 


whence Wo,1, not being equal, must be orthogonal. Therefore, if the code 
states Wo; are chosen not to be orthogonal, the spy cannot copy them with- 
out alterations. This argument goes under the name of no-cloning theorem 
and asserts that there cannot exist a unitary operator U that implements the 
operation of copying two generic quantum states, unless they are orthogonal. 
Indeed, if such an unitary operator U existed, then, on the linear combina- 
tions of two orthogonal states | ¢),| ¢), 


U((alv) + B14))@le)) =al) Bly) + 14) @14) 


= (ald) + 814)) @(al¥) + 8,14) 
= lo? ly) 91V) +1371) 14) 
+ a8 (Ib) 81d) +1)81%)). 


This can only be true if either a = 0 or 8 = 0 as one can see by scalar 
multiplication by |Y) @|¢). 


In Section 5.5.4, it has already been emphasized the central role of entan- 
glement as a resource for quantum informational tasks. Among the applica- 
tions of entangled states to information transmission are the protocols for the 
so-called dense coding and teleportation. In the first case, 2 bits can be sent 
with one use of an entangled quantum channel, which points to the possibility 
of achieving higher channel capacities if channel behave quantum mechani- 
cally. In the second case, quantum states can be transferred between distant 
parties sharing an entangled state by means of local quantum operations and 
classical communication (known as LOCC operations). 
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Example 6.1.3 (Dense Coding). If sender A and receiver B share the 
entangled state (5.164), A can encode the pairs of bits (xy) into the Bell 
states of Example 5.5.9 by local operations performed on his qubit, only. 
Indeed, the states |,.) result from acting with the Pauli matrices on the 
first qubit of |oo); explicitly 


| Gory ) = (0307) & 1|%o) ’ LLY = 0, 1. 


Then, if A and B share |oo), in order to send B two bits (x,y) of classi- 
cal information, A acts on his qubit with ofo}/ and sends it to B. When 
both qubits are with him, B has them in the state | Poy ); by performing a 
measurement in the Bell basis, he can thus recover the pair (ay). Roughly 
speaking, one can transmit two bits at the price of 1 qubit, that is by just 
one use of the entangled quantum channel represented by |Woo) and its local 
modifications. 


Example 6.1.4 (Teleportation). Suppose A has two qubits, denoted by 
1,2, the first one in the state |W): = a|0); + 8|1)ı and the second one 
. . . $ 1 à 2 
being one party in the symmetric Bell state | Woo )23 = Fa Xoli) 8 li)s 
together with a third qubit (3) of B. Let B perform a Hadamard rotation on 

his qubit in |Woo), changing the entangled state into (see Figure 6.1) 


1 
1 
| Boz := (1 @ Up) Meo)as = > 214) 2 ® Uy|i) 


Iw) |®) 


Fig. 6.1. Teleportation 


The state | a), can now be teleported from A to B becoming |4% )3. The 
protocol is as follows; A performs on his two qubits 1,2 a measurement in 
the ONB {| wr )12}3.—o of C? & C?, where |W, )i2 = On & Up |Woo) 12 with 
(YW, |W) = ZTr(opov) = Suv. Notice that the amplitude of | Y, )12 in the 
state | W)1 ® |8 )23 is 


1 


(Ply @|®)as) = 5 So (lon lb) (él Ua) Urli) 


i,j=0 


260 6 Quantum Information Theory 


1 1 
= 5 Dlilould) Uli) = onl ¥)s 
i=0 


where it has been used that Ug = Ui, and U?, = 1. Thus, after A has 
classically (that is by means of a classical channel) communicated to B the 
result of his local measurement, B knows his qubit to be in the state o| Y }3, 
whence by a local rotation by a, he gets his third qubit in the state |% )s. 

The procedure does not violate no-cloning for the state that appears at 
B’s end, disappears from A’s end. Neither does it violate Einstein’s locality; 
indeed, before classical communication of the actually measured index u, B’s 
state is the equidistributed mixture of the four possibilities corresponding to 
the four different measurement outcomes of A; explicitly, using Example 5.2.5 
with the normalized Pauli matrices o,,/V/2 as ONB, 


= Tome von, = 


On the other hand, before A’s measurement, the marginal state of B is 


= Tri 2(|W)11( | @ | ©) 2393( ® |) 


= Tõi) 5 3 Te(li)z2( l) Unlé)ss(9 Un = 5 - 


t,j=0 


Notice that the net effect of quantum teleportation is to get the third 
qubit in the rotated state o |Y) by means of a measurement in the ONB 
{| %.)12}%—9 peformed on qubits 1 and 2, when the state of 1,2,3 is |~)1 @ 
| P )23. 


Example 6.1.5. In order to implement a two-qubit gate like the unitary 
Ucnor on two target qubit states %1,2, one adds to them three pairs of qubits 
each of which in the same entangled state |®) introduced in the previous 
example. Thus, one deals with a multipartite entangled state [249, 323, 71] 


|W) := | Y1) 8 | Y2)2 @|H)34 @|H)57 @ | P)es 


corresponding to the scheme in Figure 6.2. 

Then, measurements are performed on qubits (1,3,5) and (2,4,6) in the 
ONBs obtained from the GHZ vectors as in Example 5.5.10.3. By projecting 
|W) onto the 6 qubit state | ane )135 Q | Waef )46g, one computes 


1 


(105( Yate | ® 246(Paer |) IP) = (= a) D (lot la) (rlot li) (rl es) 


s=0 
a, j,k=0 
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Fig. 6.2. One way Quantum Computation 


x (s|of |¥2) (s|ofUn li) (s|03 |k) Unlj)7 @ Un|k)s 


Il 


1 5—1 
(<5) NO (rot lyr) (sof |b) 8 | ofUnof |r) Uno§|r)7 ® Unos|s)s 


r,s=0 
= (4) (Unos @ Unos) ue Gi & of) 1 )7 @| was, 


where in summing over i,j,k it has been used that, under transposition, 
O13 = 01,3. Thus, a part from local unitary rotations, by measuring in the 
chosen ONB one implements the unitary transformation 


UP = Yo (slofUnot|r)|r)(r| @/s)(s| 
r,s=0 
= J (sGe|Un|r ®b)|r)(r]@|s)(s| 


r,s=0 


on the state | Y1 )7 Q | Y2)s of the pair of qubits that remain unaffected by 
the measurement. In particular, choosing a = b = c = d = e = f = 0, it turns 
out that V2 U9 = |0)(0 [91+] 1)(1 [903 = 18Uy Ucnor L8Up, namely 
V2U9° amounts to the CNOT gate unitary matrix apart from unitary, local 
rotations. 


6.2 Bipartite Entanglement 


We have seen in Section 5.5.4 that, by looking at its marginal states, one 
knows whether a pure bipartite state is entangled or not. For density ma- 
trices entanglement detection is by far more difficult; only in low dimension 
the problem has been completely solved by the so-called Peres-Horodecki 
criterion [236, 148, 152]. 
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Proposition 6.2.1. Let a bipartite system Sı + S2 be described by the algebra 
Ma (C) ® Ma,(C), a state pig E S(S1 + S2) is entangled if and only there 
exists a positive map A: Ma, (C) +> Ma, (C) such that ida, ® A+[p12] is not 
positive definite, where A+ : B} (C2) + BY (C%) is the dual map of A from 


the space of states S(S2) = B} (C®) to the space of states S(S,) = BI (C%). 


Proof: The set Ssep(S1 + S2) of separable states over Ma, (C) ® Ma,(C) 
is the closure in trace-norm of the convex hull of pure separable states (see 
Remark 5.5.7). By the Hahn-Banach theorem [258], Ssep(S1 + S2) can be 
strictly separated from any entangled state pent by a hyperplane, that is 
by a continuous linear functional R : S(S; + S2) + R and a real constant 
a such that R(pent) < a < R(Psep). As the trace-norm and the Hilbert- 
Schmidt topology are equivalent in finite dimension, using the argument of 
Example 5.2.4, the action of R can be represented by means of R = Rt € 
Ma, (C) ® Ma, (C) such that R(p) = Tr(R p). Setting S := R’ — all, it thus 
follows that p € Ma,(C) & Ma,(C) is entangled if and only if there exists 
S € Ma,(C) ® Ma,(C) such that Tr(S p) < 0 while Tr(S psep) > 0 for all 
Psep € Ssep( S1 F S2). 

Furthermore, to any such matrix, the Jamiołkowski isomorphism (see Re- 
mark 5.2.5) associates a positive map Ag : Ma, (C) + Ma, (C) with S as Choi 
matrix. Let Af : BI (C%) + BI (C%) be its dual such that 


Tr(S p) =Tr (ida, & As[P?] p) = Tr (Pi ida, @ At il) i 


for all p € S(S1 + S2). If p is an entangled state such that Tr(S p) < 0, then 
ida, ® A$ [p] cannot be positive definite. Vice versa, if ida, ® A” [p] > 0 for 
all positive A: Ma, (C) +» Ma, (C), then p € Ssep(S1 + S2). 


As a consequence of the previous argument, a map A: Ma, (C) > Ma, (©) 
is a witness of the entanglement of the state p € S(S1 + S2) if ida, @ A* turns 
p into a non-positive matrix. Therefore, A cannot be a CP map; however it 
preserves positivity. Indeed, the Choi matrix L € Ma, (C) Ma, (C) associated 
to the dual map A* is block positive for Tr(L p) > 0 whenever p is separable, 
that is (Y| L|YI¢) for all y € C% and ¢ € C®, whence At is a positive 
map. 

Unfortunately, as already noticed (see Remark 5.2.6.3), unlike CP maps 
for which Proposition 5.2.1 holds, positive linear maps still lack a complete 
characterization. Consequently, given an entangled state p € S(S1 + S2) it 
is usually rather difficult to find a corresponding entanglement witness A. A 
relatively understood sub-class of positive maps is the following one. 


Definition 6.2.1 (Decomposable Maps). A map A : B(H) + B(K) is 
decomposable if it is positive and A = A, + Ago Tm, with A12 CP maps and 
Tu the transposition on B(H) with respect to a fixed orthonormal basis in H. 
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Let (di,d2) = (2,2), (2,3), (3,2), then a theorem of Woronowicz [321] 
asserts that all positive maps A : Ma, (C) —> Mg,(C) are decomposable. 
This fact makes transposition an exhaustive entanglement witness in low 
dimension; in other words, for the stated dimensions, those states that remain 


positive under partial transposition, are separable and viceversa. 


Corollary 6.2.1. If in the previous proposition (d1, dz) = (2, 2), (2,3), (3, 2), 
then, pig E S(S1 + S2) is entangled if and only if T” [p12] is not positive- 
definite, where 7?) := ida, ® Ta, denotes partial transposition on the second 
factor. 


Proof: If p € S(S; + S2) is separable then T? [o] > 0 for transposition is 
a positive map. Vice versa, because of the assumption, Woronowicz theorem 
ensures that any positive map is decomposable. Therefore, if TË [o] > 0, it 
turns out that, for all positive A: Ma, (C) —> Ma, (C) 


? 


ida, ® Alp] = ida, ® Aifa] + ida, ® 4[T® [p]] > 0, 


as 41,2 are CP maps. 


Remarks 6.2.1. 


1. Though partial transposition as transposition are to be defined with re- 
spect to a chosen ONB, the spectrum of an operator is basis-independent; 
therefore, the non-positivity of TË [P12], thence the entanglement of p12, 
does not depend on the ONB with respect to which the partial transpo- 
sition is performed. 

2. Those states which remain positive under partial transposition are called 
PPT states, otherwise NPT states, namely negative under partial trans- 
position. Woronowicz theorem does not extend to higher dimension; 
there are instances of non-decomposable positive maps already for dı = 
d2 = 3 [83, 130, 288, 321]; as a consequence partial transposition is 
not an exhaustive entanglement witness in higher dimension. In other 
words, all NPT states are entangled, but there can exist PPT entangled 
states [149, 297]. 

3. No pure bipartite state can be PPT entangled; indeed, by Proposi- 
tion 5.5.7, entangled vector states | Y12) € C% & C% have a Schmidt 


d 
decomposition (5.159) of the form |Yi2) = X Vly) @ |v), 
j=1 


where d := max{ d1, d2}, {| gA )}¢_, are orthonormal sets in the Hilbert 
spaces C®%:2 and the Schmidt coefficients A; > 0 for at least two indices. 


Set Piz := | i2 ) (Yio |; the partial transposition with respect to the ONB 
having {| yP” De among its elements yields 
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Riz = T? [Pa] = + PAPBPP]. 


i, j=1 


Let A12 > 0, then 


Rie 


PP) — |p py PYP) — | wP yy 
a = —y ` A2 5 . 


Thus T? [P2] cannot be positive if Pj is entangled. 

The entanglement of PPT entangled density matrices cannot be detected 
by decomposable positive maps as one can see from an argument similar 
to the one used in the proof of Corollary 6.2.1. An instance of such states 
will be discussed in Example 6.2.4. 


The following ones are families of bipartite states over C? @ C4, d > 2, 


where PPT states are always separable. 


Examples 6.2.1. 


l; 


Werner States [317] This is a class of d? x d? density matrices on 
C4 & C4 of the form pw = allg + 8V where V is the flip operator 
(see (5.32)) and W := Tr(pwV) (*). 

As the eigenvalues of V are +1, those of pw are a+ and must be positive. 
Also, V? = te and TV = Yj lii V li) = Daaa Cili) = d; 
thus, normalization and (*) yield ad? + 6d = 1 and W = ad + b@?, 
whence 


pg a ee E L E 

~ d(d? —1)’ d(d —1)? `°” d(d+1)’ ~ d(d—1 
d(d—- W) l  dW-—1 

= | <W<1. . 

Me pai P aea a Sa (6.2) 


If pw is separable as in (5.166), by spectralizing the contributing density 
matrices, it can always be recast as pw = D7, pijl i )(¥7 | @| 07 )( 47 |, 
Hij > 0, Ža Hij = 1. Then, 


W = Tr(owV) = X mlhi BUF |V di S47) = D ali | 45 D1 2 


ij 


This is a necessary condition for the separability of Werner states in 
dimension d; it is also sufficient; the clue is that Werner states are exactly 
those states on C? that commute with all unitaries of the form U & U 
with U a unitary matrix in M,(C). Practically, since V(A@B)V = B&A 
for all A, B € Ma(C), all Werner states have the form 
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pw = | av Uav)puleut), 
u 


where dU is the normalized, invariant Haar measure over the unitary 
group U on C? ?; furthermore, W = Tr(p V). 

If1>W >0, let p € C4, |¢) = VWI) + VI-W|y¢+) € C4, with 
(p| vt) =0 and set p:=|¢)(6|@|v)(W|. Thus, pw arises by twirling 
a separable state with tensor products of local unitaries, therefore it is 
itself separable, as local actions cannot create entanglement. 

The necessary and sufficient condition for separability, W > 0, turns out 
to be equivalent to pw being positive under partial transposition. Indeed, 
by applying TË = idg ® Tg to pw one gets 


d-W _dW-ils 


2 
T” lw] = TMD oe a1 Ft" 


Its eigenvalues (d — W)/(d? — d) > 0 and W/d are positive if and only if 
W > 0. 

2. Isotropic States [151] This is a class of d? x d? density matrices on 
C4 @ C? which are related to Werner states by partial transposition. 
They have the form pr = alaz + 8 Fe and are uniquely identified by the 
parameter 0 < F := Tr(pr P4) (*). 

Like for Werner states, positivity, normalization and (*) yield a > 0, 
ad?+ 8=1land1>F=a+{ > 0, whence isotropic states are mixtures 
of the totally depolarized state on C? and of the totally symmetric state, 


Ê- F) le  @F-1-, 
= | TOIR 6.3 
o= er e FeeL a 
Since (7 @¢@| P4 | @b) = (y| * )|2, where Y* is the vector in C? with 
complex conjugate components with respect to w, if pr is separable, then 
(see the previous example) 


eae 


F = Tr(prP?) = gm (di | (63)*)P? < 


As for Werner states, 0 < F < 1/d is necessary and also sufficient for 
separability. The reason is that isotropic states are all and only those 


?The particular convex combination of states (U & U) p (Ut @ U") appearing in 
the integral is known as twirling. Twirled p are such that, for all unitary V, 


VeV (| av eu put eu") vievi= f au VU &VU p(UV)' & (UV)! 
u u e i 


W 


= | aviwywewewiewt = | aveuputen', 
uU u 


for the Haar measure satisfies d((VU) = dU for all unitary V. 
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d? x d? density matrices which commute with local unitaries of the form 
U & U*, where U is any unitary matrix in Mq(C) and U* denotes its 
complex conjugate (not its adjoint). Moreover, one can show that, since 
(U® U*) Put & UT) = Pa, any isotropic pp arises from a twirling of 
the form 
pr = | aU (U80) oUt o0), 
u 


where UT denotes the transposition of U and p is such that Tr(pP?) =F. 
If Fd < 1, set |¢) = VdF |) +V1 — dF | y+) and choose p = | Y) |@ 
| @)( |; then, pr can be obtained by twirling a separable state and is thus 
itself separable. 
The above necessary and sufficient conditions for separability coincides 
with the isotropic states being positive under partial transposition. In- 
deed, , 
Geo. I=, PPL! 

Teds gay? age ay © 
has positive eigenvalues (dF + 1)/(d? + d) > 0 and (1 — dF)/(d? — d) if 
and only if 0 < F < 1/d. 


Distillability and Bound Entanglement 


Entangled states of two qubits as the Bell states (see Example 5.5.9) are called 
maximally entangled. Consider a pure state |W.) € C4 @ C4 of a bipartite 
system consisting of two copies of a same system; as we shall see, there 
are good reasons to measure the amount of entanglement of p12 by means 
of the von Neumann entropy of any of its two marginal density matrices 


pe?) = Tra, ( Da) I) (see Proposition 5.5.5). 


Definition 6.2.1 (Pure State Entanglement). Let | V12) E CQC? be a 
pure state of the bipartite system Sı + S2; the entanglement of | Viz) is 


Ep[| a2 (Yaa ll = 5 (09,2) - (6.4) 


According to Example 5.5.10.1, all Bell states have marginal states that 
are the tracial state with maximal von Neumann entropy: E[Êsy] = log 2. 
The presence of uncontrollable interactions with the environment in which 
a bipartite system may be immersed usually spoils its maximally entangled 
{Ol)—|10) . 
= ———— might 
Fa 
be rotated into |W) = a|01) + 6|10), with |a|? + |8|? = 1 and 0 < ja] < 1, 
so that 


states. For instance, the so-called singlet state |W)_ : 
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2 
e roe E[W] = —|a|? log |a]? — |81? 10g |8]? < log 2 . 


It may even be turned into a mixed state because the environment usually 
acts as a source of noise and dissipation. A measure of the entanglement of 
pı2 € S(S1 + S2) is as follows °. 


Definition 6.2.2 (Entanglement of Formation). /58] The entanglement 
of formation of a state pı2 of a bipartite system Sı + So is the least average 
pure state entanglement over all convex decompositions of p12, 


Ep[pial = min 4 S755 (0%) , pie = Alva (lal p > (63) 
J 


where àj > 0 and >), A; =1. 


Maximal entanglement is an important resource in quantum information, 
but also a highly degradable one; of particular importance are then those 
quantum protocols that enable to distil maximally entangled states out of 
non-maximally entangled ones by means of LOCC +. The basic scheme of 
a distillation protocol is as follows: given m copies of p12 € BI (C4 @ C4), 
one tries to maximize the number n of copies of the singlet state projection 
P_ := |W_)(W_| that can be obtained by means of local operations and 
classical communication: 


LOCC 
P12 8 P12 Q `: p12 P -QOP BPa 
——— > 1a 
m times n times 


The LOCC defining the distillation protocols amount to maps of the form 


m n 1 i 
per pl) = ‘Ne X Ai ® Ao; pf" Al; @ Al, , (6.6) 
iel 


where 
Nr = YO Te(Al;An ® A}, Asi Ta ) 
el 


while Aj; : (C¢)®™ = (C?)®", j = 1,2. 


3Various entanglement measures have appeared while quantum entanglement 
theory has been developing, for a review and the related literature see the contri- 
bution by M.B. Plenio and S.S. Virmani in [71]. 

“For a review of entanglement distillation and the other topics of this Section 
see [70] and the contributions by A. Sen, U. Sen, M.Lewenstein et al., W. Diir and 
H.-J. Briegel, and P. Horodecki in [71] 
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Practically, one seeks distillation protocols that output states p) whose 
distance from P®” (for instance, with respect to the trace-norm (5.21)) van- 
ishes when m — +00, while the ratio n/m is the highest possible. The optimal 
ratio, denoted by Ep[p12], is called entanglement of distillation and represents 
the maximal asymptotic fraction of singlets per p12 that one can achieve by 
LOCC. In other words, one can hope to distil at most n ~ m Ep[p12]]| singlets 
P_ out of m pı2 when m gets sufficiently large. 

It turns out that PPT states p12 cannot be distilled [150]; when p12 is sep- 
arable this is obvious since one cannot create non-local quantum correlations 
by means of local operations and classical communication. The interesting 
point is that one has to distinguish between free entanglement, the entan- 
glement which can be distilled, and bound entanglement, that which cannot 
be distilled. The result just quoted can be rephrased by saying that the en- 
tanglement of PPT entangled states is bound. This can be seen as follows: if 
the entanglement in p12 is distillable, then for some m, the state p in (6.6) 
must be an entangled state of 2 qubits. whence an NPT state according to 
Corollary 6.2.1. This implies that, for at least one index ig € I, the (non- 
normalized) state 
@m at 


lio 


@ Al (6.7) 


is NPT. Observe that, for such m and io, Aji, : (C4)®™ + C?; therefore, 
one can always write Ajig = ane |k)(w5x |, where | jx) € (C4)®™ and 
|k}, k = 0,1, is any chosen basis in C*. Let Q; be the projections onto the 
subspaces of (C7)®” spanned by | jo) and | #,1 ); then, 


P12 = Arig Q Arig Pio 


2io 


Di2 := Arig ® Azio Q1 @ QPR Qi ® Q2A Ta & Al, 


implies that py := Q1 ® Q2 pty" Qi ® Q2 must be entangled, otherwise its 
separability would be preserved when passing to p12. 

Consider now the orthonormal bases {| bjn )}@_, in (C4)®™, j = 1, 2, such 
that Q; = |bjı )(bjı | + | bj2)(b;2 |; in the corresponding representation p', 
is a 4 x 4 matrix acting on the subspace K spanned by the product states 
| bi;b2;), i,j = 1,2. Since it corresponds to an entangled state, by partial 
transposition with respect to the ONB {| ba; ee 1 1 2[p42] cannot be positive 
semi-definite. Therefore, there must exist |} € K such that 


2 


(D| Talia] |G) = SBE Gxo ( briba; | Tolia] |[bixbze) 
4,9; £=1 
= D DijBre (biibze | Piz |bikb2j ) = 3 Di; Pye (biibae | pir” |bixbay ) 
ijik l=1 ijik i=l 


= (8|TR"] |2) <0, 


where er ] is now the partial transposition with respect to the whole ONB 
{| b2; p . Also, the last equality follows because |®) is supported by the 
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subspace K corresponding to the orthogonal projection Q1 ® Q2 and thus has 
vanishing projections onto all | b1;b2; ) unless i, j = 1, 2. 

Since NPT is a property which does not depend on the basis chosen 
to compute the partial transposition (see Remark 6.2.1.1), fix the bases 
{|ej;x)}¢_, in C4 and choose in (C @ C¢)®”™ the product basis consisting of 
vectors | €1%, Elka «++ €1km, ) Q| €261 E263 ---€2¢,, )» Then, T[p%}”] = (T2[p12])®”; 
one thus concludes that p12 is distillable only if p12 is NPT. 


Remark 6.2.2. From Remark 6.2.1.3 we know that no pure PPT entangled 
state can exist; it turns out that their entanglement is always distillable and 
thus free. Whether the entanglement of generic NPT states is also free, that is 
whether all NPT states are distillable, is one of the open problems in quantum 
information theory [71, 152]. 


Entanglement Cost 


One of the first questions in quantum information has been whether, by 
means of LOCC one can turn a pure state |W.) of a bipartite system into 
another pure state |812). The answer is that this is possible if and only if 
the marginal states je 
Definition 5.5.1 [224]. 

If one considers asymptotic LOCC protocols where m copies of a state 
|W) are turned into n copies of a state |12) with vanishing error when 
m — +09, then the transformation of | W12 } into | 812 ) is possible if and only 
if [71] 


are more mixed than those of | ®12) in the sense of 


ng Er [V12] 
m` Er [S12] 


Since Ep[W_] = 1 (we shall use log, in the following), one can always asymp- 
totically distil n < mEp[W.2] copies of P_ out of m copies of any pure 
bipartite entangled state Wo. 

Furthermore, the reverse operation is also possible; namely, protocols have 
been devised which invert distillation and, by using m copies of the sin- 
glet state P_, form, by means of LOCC, n copies of a bipartite pure state 
Wi. Actually, like in the case of entanglement distillation, one considers the 
asymptotic minimal ratio m/n when n — +00 and p% is better and bet- 
ter approximated (within a suitable distance) by a suitable LOCC operation 
acting on P®™ [137]. The optimal asymptotic ratio, denoted by Ec[p12] is 
called the entanglement cost of p12; it represents the minimal fraction of sin- 
glet that is needed to create one bipartite system in the state p12. In other 
words, for large n, one can create n copies of p12 only acting with LOCC on 
no less than n Ec[p12] singlets. 

In [224] a distillation protocol Ap is constructed which asymptotically 


yields Ep[W2] = S (of?) singlets per bipartite entangled state p12 (see (6.4)) 
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and a formation protocol Ar that asymptotically yields one copy of p12 at 
the cost of Er[Y,2] singlets. It turns out that the entanglement cost and the 
entanglement of distillation equal the pure state entanglement of formation. 

By definition, Ec [io] < Ep [Via] < Ep|[“12]; if Ep [Vio] < Ep [%12]; then, 


by means of the protocol Ap one could asymptotically obtain 


copies 
Er [Y2] 


of YW. out of m copies of P_ and then, using an optimal distillation protocol, 
Ep [X12] 
Er [12] 
cannot increase the amount of entanglement by deterministic LOCC °. Anal- 
ogously, if Ec/Wi2] < Er[W2], then one could use an optimal creation protocol 


extract from them m > m copies of P_. This is impossible as one 


to obtain —" — copies of p12 out of m copies of P_ (for m large) and then 
Ec[Y12] 


use the distillation protocol Ap to extract from them m 


of P_. 

For pure states, forming entangled states from singlets and distilling sin- 
glets from entangled states are reversible operations; it is not so for mixed 
states and the reason for this peculiar kind of irreversibility is bound entan- 
glement [152, 150, 71]. 

Consider the entanglement of formation as defined by (6.5); it can be 
interpreted as the minimal averaged entanglement cost of p 2. In fact, given 


Er [V12] 


———— > m copies 
Ec[v2] p 


a convex decomposition of p12 = 7, Agl wi,)(wi, |, the entropies S (a ) 
12 
are the entanglement cost of the pure states that decompose it. However, in 
line of principle, it could be more advantageous to create the tensor product 
pis’ instead of the n copies of p12 one by one. One is thus led to define the 


so-called regularized entanglement of formation 
1 
EP = lim + Ep[p&]. 6.8 
F lia] = | lim y Erle] (6.8) 


Such a limit exists because the entanglement of formation is subadditive. 


Indeed, consider the state py @ pP 


S, + So E I Erle] is achieved at the (optimal) decompositions 


pÈ = = wa è Ha” 9; @) |. Since the decomposition 


of two copies of the bipartite system 


pi? @ p® = Dee LAP A ele a 


need not in general be optimal for Er [oS @ po), it follows that 


2 
Erft @ pQ] < Erlo] + Er[ol?] . 


°One can achieve entanglement increase by LOCC only probabilistically for 
certain states of a mixture, for instance in some of the states in (6.6), but not on 
the average for the whole mixture. 
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In [137] it is proved that the regularized entanglement of formation equals 
the entanglement cost: Ec[p12] = EF [p12]. Moreover (see P. Horodecki’s con- 
tribution in [71]), it has been proved that Ec[pi2] > 0 for all entangled p12. 


As a consequence of the fact that, if p12 is PPT entangled, no entangle- 
ment can be distilled from it, it thus turns out that a non-zero non-retrievable 
amount of entanglement (of singlets) is always necessary to create PPT en- 
tangled states. 


Concurrence 


We shall now elaborate a little bit more in detail on the entanglement of 
formation of two qubit states. 
Let S4,Bg be two qubits , |0) = l \1) = (2) the standard basis in 


C? and consider a generic two qubit vector state of S4 + Spg of the form 
|Wap) = Coo| 00 ) + C5, | 01) + Cio| 10) + Ci | 11) i 


Coo Cor 


Then, the marginal state p4 = CC', C = ( 
Cio Cu 


) has eigenvalues 


14+ /1=C@any 
5) ( AB) ; C(Wap) =2 \CooCi1 = Co1Cio| b (6.9) 
This expression can be recast as follows. Let |W% p) denote the complex con- 


jugate of |As ) with respect to the standard product basis {|ij)}; ;-9 , and 
denote 


| Yap) := 02@09|Vap) = —Cho|11)+C%,|10)+C%,|01)—CF,| 00) , (6.10) 


for 02 = (s a is such that o2|0) = i| 1), o2|1) = —i|0}. Then, 


(Pap | Yap) =C(Wap) . (6.11) 


* 


Since a & L (5) , it turns out that C(W4p2) = 0 when ¥4pg is separable, 


while C(W4g) reaches its maximum C(W4g) = 1 when Yag is maximally 
entangled and (only) two coefficients Ci; are proportional to 271/2, 
Therefore, for two qubit vector states, the entanglement of formation reads 


Ep|| Yap ) (Yas |] = €(Waz) := H2 (| tvi a) ; (6.12) 


where Ha(x) := —z log z — (1 — z) log(1 — 2). 
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The variational problem embodied in Definition 6.2.2 is in general ex- 
tremely difficult to solve and a general closed expression of E(p) as a function 
of p has been found only in the case of two qubits ; it is based upon the notion 
of concurrence [320]. 

Given a two qubit density matrix p E€ M4(C), one first constructs the 
density matrix 

Pp = 02 I P*O Q 02 , (6.13) 
obtained via the operation (6.10), where p* denotes complex conjugation with 
respect to the the standard basis {|ij)}; ;-91- Then, the quantity C(Waz) 
in (6.9) generalizes to density matrices as follows. 


Definition 6.2.3 (Concurrence). Let \;, i = 1,2,3,4, be the positive 
eigenvalues of \/,/pp,/p in decreasing order. The concurrence of p is 
C(p) := max{Ay — à2 — A3 — Aa, 0} . (6.14) 


Examples 6.2.2. 
1. Pure states Let p = |w)(wI, |) € C*. Then, 
2 
| 


vob = W| OA, 


whence C(p) = |(4lġ)|. 
2. Werner states Setting d = 2 in (6.2) 


, | 
oyo = =F, oa 

1 — 03 01 —109 
Da5, mo H, 


it follows that 


À 1 
P? = (1801+01 801 -02802 +03 803) 


1 
V =id @T[P2] = 5 (181+ 01 801+ 02902 +0980) 
1 ow = 
=-(1@14 
ee j(1@ 3 


Since W € R, the algebraic relations among the Pauli matrices yield 


Pw = pw, whence the eigenvalues of ,/,/pwpw./pw are those of pw, 


namely +4 (thrice degenerate) and +5”. It then follows that 


1 
(01 @ 01 + 02 8 02 + 03. 03) ) R 


max{—W, 0} -1 <W <1/2 


C(pw) = ; 
max{(W — 2)/3,0} 1/2<W<1 


whence C (pw) > 0 and pw is entangled if and only if W < 0, in agreement 
with Example 6.2.1.1 
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3. Isotropic States Setting d = 2 in (6.3) and arguing as in the previous 
example, the isotropic states read 


1 


AF —1 
pr =z (1@1+ 


(01 8 01 — 02 B02 + 03 B08)) a 


Again, it turns out that pr = pr so that the eigenvalues of ,/,/prpr,/pr 


are those of pp itself, namely LE thrice degenerate and F. Thus, 


max{2F —1,0} 1/4<F<1 


C(pr)= 
max{—(1+2F)/3,0} 0<F<1/4 


whence pp is entangled if and only if F > 1/2, in agreement with Exam- 
ple 6.2.1.2. 


By direct inspection, the function (see (6.12)) 


E(C(W)) = He (= a ) (6.15) 


is monotonically increasing (E'(x) > 0, 0 < x < 1) and convex (E"(x) > 0, 
0 <a <1) in the concurrence. As 0 < C(w) < 1, the entanglement increases 
from €(C(w)) = 0 for separable vector states to E(C(w)) = 1 for maximally 
entangled states and 


E(Axı + (1 — A)z2) < AE (x1) + (1 — AJE(z2), OS AK1,0< a2 <1. 
Further, given a decomposition p = a pil Yi yy; |, let 
(C)p=d, pilpil = X Cly) (6.16) 
J 


(E) p=5 ; py hba) sl = DP E(C(;)) (6.17) 


denote the corresponding average concurrence, respectively the average en- 
tanglement. Because of convexity, it turns out that 


E (Ca < (E) =5; pi lvy wsl - (6.18) 


Theorem 6.2.1. /320] The entanglement of formation (6.2.2) of any state p 
of a two qubit system is given by Er[p] = E(C(p)) and is thus a monotonically 
increasing function of the concurrence (6.14). 
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Proof: The right hand side of (6.18) is the argument of the minimum 
n (6.5) (see (6.12)), whereas the left hand side is an increasing function 
of its argument. It thus follows that Ep[p] cannot be smaller than €(Cmin) 
where Cmin is the smallest average concurrence. Therefore, if Cmin is attained 
at a suitable decomposition, then the same decomposition yields Ep|p] = 
E(Cmin). We will construct a density matrix p = $`; pj| V3) (Y; | such that 
(C) =5 , pj Lvs) (y; | = C(p) and show that no smaller average concurrence can 
be achieved, namely Cmin = C(p). 

In order to arrive at such decomposition, we first consider the expansion 
p= >, |v) (vi |, n < 4 being the rank of p, and |v;) its (non-normalized) 
eigenvectors such that (v;|v;) = rjôĝij, with rj the eigenvalues of p. 

The n x n matrix T with entries Tij := (v;|0;), where |0;) := 02 Q o|v;), 
is symmetric, (v,;|0;) = (v;|%;), but not hermitian and 


nm 


(TT Jij = (7 7*)ig = X (vil) rlu) = (ril Vo lr) - 


k=1 
Thus the eigenvalues of r7* are the squares of the eigenvalues A; of \/pp,/p 


in decreasing order (see (6.14)). Let Z be the n x n unitary matrix that 
diagonalizes TT*, 


Zrr* ZÝ = (ZTZT\(ZtTZT)* = diag(à?, AŽ, A2,A2), 


then Z can be chosen such that ZrZT = diag(à1, Az, A3, A4) is diagonal with 
the \;’s as eigenvalues. Setting |wi) := 05, Z7,|v;) gives p = X; |wi) (wil, 
with decomposers such that 


(w;|W;) = 3 ZikZ je (Up|0e) = (2727) 5 = = Vidi; . 
kjl=1 


Case 1: Ay < A2 +A3 + Aq. 


Because of the ordering of the ;’s, this case is possible if n > 3. Con- 
sider the quantity f(@) := vt, Axe: since f(0, 1/2, 2/2, 7/2) < 0 while 
f(0,0,0,0) > 0, by continuity f(~) = 0 at some y. Using the vectors | w; ) 
introduced above, let 


P 1 1 1 1 
ns th a 
la) =) Cues), C= Cas i 7 7 1] > 
1 


-1 -1 1 
where e?’4 and | z4) do not appear if Aq = 0. 


Introducing the normalized vectors | 4%; ) := | z;)/||z;||, from CTC = 1 it 
turns out that 
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: ~. _ IAC) 
p= Si lal? lbs Ms], and [abel Bel = Grogs =O 


i=l 


for alli = 1,2,3,4. Then, the vectors |; ) and thus p are separable. 
Case 2: Ay > Ag +A3 + Ag. 

Set |y1) =|w1), ys) = ilw), if j > 2. Then p = Ð; | yz) (yy |; fur- 
ther, consider the diagonal matrix Y = diag(A1, —A2, —A3, —A4) with entries 
Yij = (yi | 95 )- 

Because of Example 5.5.4, any other decomposition p = $5}; |z; )( 2; | 
is such that | z;) = 07, Vji|yi), with V a unitary matrix on C”. Therefore, 
for orthogonal V, the quantity 

Corts lee D 3 VjiVjkY;ij 

j=1 j;i,k=1 
= Tr(VYVT) =Tr(¥) = dy — Ag — Ag— Ag = C (p) 
is independent of V. By using this invariance property, one can find a decom- 
position p = )7"_, | 23 )( zj | such that 


(315) =C) = 15)1= lac (25) 


for all 1 < j < n. Thus, its average concurrence (6.16) equals C(p), 
(c)= lsc (FE) = Dela) =c0) 
j=l J j=l 

The | z;) are constructed as follows: unless all the Y;; are already equal 
to C(p), there must be one decomposer, yı say, with Yj; > C(p), and an- 
other one, y2, with Yo2 < C(p). Choosing V that exchanges yı with y2 
and leaves the other decomposers fixed, we obtain a decomposition with 
the same average concurrence and Yj1, Y22 exchanged. By continuity there 
must exist an Se matrix V such that (21|21) = (z2|Z2) = C(p), with 
lzi) = VL, Vjilvs). Iteration of this argument for the remaining decomposers 
yields the result. 

The proof of the theorem is then concluded by showing that no decom- 
position can achieve a smaller average concurrence than C(p). Indeed, using 
again Example 5.5.4, a generic decomposition has average concurrence 


Q 
(C) p32, | 24){ zal => (Za | 2q)| N= 


q=1 


n 


SA (Za) Ya 


i=l 


2 


where Z : C” +> C® is an isometry: ee (Za | = 1 for all 1 <i<n. 
Since one can always adjust the phases of the Z4; in such a way that (Z,1)? = 
|(Zq1)?| > 0, for all 1 < q < Q, then 
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Q n 
DO (Za) Yu 


(C) 2558 lza)(zal 2 


q=1 i=1 q=1 i=2 
Q n 
> Mi = [A D aA 2 Clo) - 
q=1 i=2 
Indeed, by assumption, 

Q n n Q 
NOYO (ZPA < YOYO | (Zi)? | bi = A2 + As +4 < Ar. 
q=1 i=2 i=2 q=1 


Two-Mode Gaussian States 


Let S be a bipartite continuous variable system consisting of two subsystems 


A and B described by annihilation and creation operators a, Cs lee eer 
and bi, i=1,2,...,¢,pt+q=f, satisfying the CCR (5.92), and arranged, 


as in (5.95), into a vector 
X = (a,b, al bt), a = (a1,...,ap) , b := (b1,...,0q) - 


We know that a state of S' described by a density matrix p is specified by the 
characteristic function (5.117) which now reads 

FY(z)=Tr (pe?"*) =Tr (pe%A 2 e758) (6.19) 
where A := (a,a'), B := (b,b'), Zap := (Zab, =z} p) with za = 
(Za1;2a2;+++;Zap) and Zp := (21, 22,---, 2bq)- Let Ta denote the transpo- 
sition with respect to the orthonormal basis of the occupation number states 
| ka) =|katka2..-kap), kai € N, of the subsystem A (see (5.93)); then, using 
the number state basis {| kake )}x,,4,, one calculates 


Tr (p™ e72) = D (hak |0™ ado) Jado le7* 4 @ e7? kako) 


ka.ky 
Jado 


= Y (gako| plRads)(dale2*4 |ka) (jele? P |e) 


ka.ky 
Jado 


= So (ako | pliads) (Ra le2* lja) (jale P Iks) 


ka ky 
Ja:db 


= Tr (p24 @ eB) , 


where A := (a',a). The last equality easily follows from 
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P 
n. E -a;—z*.al 
(kea 07 lja) = [I fas [07t sie 


i=l 


Fai) 


and 
za—z*al |. -| —zatz*at * -) —z*atzat 
(kle li) = ((jle*47* © |k))* = (j |677 t |k). 


Partial transposition thus amounts to changing annihilation operators of a 
chosen subsystem into creation operators within the Weyl operator appearing 
into the characteristic function of a bipartite state. 

In terms of position and momentum operators this means keeping fixed 
Ga = (a + at) / V2, qp = (b+ b')/V2 and D, = (b — bt) /(iv/2 while changing 
Pa = (a — at) /(iv/2) into —p,. This observation identifies partial transposi- 
tion as a local mirror reflection [278]. Also in the continuous variable case, 
separable bipartite states must remain positive, hence well-defined states, 
under partial transposition. Then, if the correlation matrix associated with 
p?” fails to satisfy (5.113) the state p is surely entangled. In view of the fact 
that positivity under partial transposition fails to be equivalent to separabil- 
ity already for two 3-level systems, one may suspect this to be the case for 
all continuous variable systems as well. Surprisingly it turns out that par- 
tial transposition is an exhaustive entanglement witness also for two-mode 
Gaussian states [278] (see also [103, 102, 198]). 

We shall use the notation of Examples 5.5.3 and start by noting that one 
can always consider Gaussian states p with characteristic function 


G(R) = e7 IR(®ı VIR) 


as in (5.131). Indeed, as local operations that do not alter the entanglement 
properties, the displacement operators D(u) = D(u1) ® D(u2) can be used 
to set the mean values Tr(p(q,p)) = 0. Partial transposition on the first 
mode amounts to replacing pı with —p; in V thus Js with —J3 in (5.138) 
(see (5.131)— (5.135)). Thus p and p?* are well-defined states if and only if 
both the following inequalities hold 


fee fy Ade Oy (6.20) 


4e/ Se Ty SOF (6.21) 


Ble BR 


Also, the operations leading from a generic V to the standard form Vo 
in (5.137) are local ones, acting independently on the two subsystems A 
and B; this means that if a two-mode Gaussian state p with correlation ma- 
trix V is separable the same is true of the two-mode Gaussian state pọ with 
correlation matrix Vo. In [278] it is showed that 


Lemma 6.2.1. All two-mode Gaussian states with Iz > 0 are separable. 
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Notice that, when I3 > 0, (6.20) implies (6.21) whence p7? > 0 in agree- 
ment with its being separable. Suppose instead that a two-mode Gaussian 
state p with Iz < 0 be PPT , then its mirror reflected p’ has Is > 0 and is 
thus separable by the Lemma, whence by a second mirror reflection also p is 
separable. 


Proof of Lemma 6.2.1 The strategy of the proof is to show that if a two- 
mode Gaussian state p has a correlation matrix V with Iz > 0, then V > 14/2 
whence, because of Example 5.5.3.3, p is separable. Because of the possibility 
of reducing V to the standard form 


a 0 y% 0 

0 a 0% 
Yo = ; 
° n 0 8 0 

0x 0 £6 


by local operations, one can equivalently show that [3 = y%19%2 > 0 implies 
Vo > 1/2. Analogously, since matrices of the form O(x) = diag(x,2~+), 
0 Æ x € R, implement local scalings of positions and momenta which preserve 


the symplectic matrix J = > : , one can focus upon 
O(y)O(z) 0 y, (PHEW 0 = 
0 O(y)O(a-") } 0 O(a~*)O(y) 
a(xy)? 0 yy? 0 
= 0 a(ay)~? 0 yoy yl 
vy 0 Ba*y? 0 = 
0 yoy? 0 pry? 


Consider the 2 x 2 matrices 


ar? y ) ar? y 
x= + YS 
( y pr? y pe? 


and notice that, according to (5.132)— (5.135), their entries are correlations in- 
volving position (qi,2), respectively momentum operators (p1,2). Their eigen- 
values are 


1 
C= zlar + ba? + /4y + (az? — ba)?) 
1 
T4 i= 5 (a2 + ba? + 4724 (ar? — ba?)?) 
ae ia ; Y4 
with eigenvectors | x+ ) = we! respectively | yz) = ie such that 
ti C1 2 9) 2 _2\2,-2\ 
zi = = = (ax* — brf) 4 1/4 + (ax? — ba-?)2c; ) 
2 
Y+ C2 = = = _9\~ 
afk = re = 2( (ax? — br*)\c,' + y4 + (az~? — bz?)?cz 7) 
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Since | x4 ), respectively | y+ ) are the rows of the orthogonal rotation matrices 
Ox and Oy which diagonalize X, respectively Y, one can make Ox = Oy = 0 
by choosing the scaling parameter x such that 


‘YL. Y2 


ax? —ba-2  ax™?-— br?’ 


ayı + by2 
ay2 + by 
X and Y of Vj is obtained by means of a same orthogonal rotation, the 


namely z? = . Since the diagonalization of the two sub-matrices 


overall transformation is symplectic (that is it preserves 2 = ( - ) J); 
2 do 


Therefore, one can study the diagonal matrix 


Y? £4 0 0 0 
y" := 0 y~” Y+ 0 0 
9 0 0 y?r 0 , 
0 0 0 y?y- 
which must satisfy VY + £2 > 0 whence x,y; > 1/4 and r_y_ > 1/4. By 
choosing the remaining scaling parameter y such that y?x_ = y~? y_ one 


gets that all four eigenvalues are > 1/2 and thus that VJ > 14/2. This means 
that the two-mode Gaussian state pj corresponding to such a correlation 
matrix has a P-representation (5.111) with a positive phase-space function 
Rő (rý) and is thus separable according to Example 5.5.3.3. Observe that this 
fact does not allows one to directly infer that also the state pọ with correlation 
matrix Vj is separable; indeed, the diagonalization of Vj has been obtained 
by non-local rotations involving both sub-systems. However, using (5.141) 
in Remark 5.5.3, the positive phase-space distribution Rg (rg) is obtained 
from the function R6(15) relative to the P-representation of p’ by means of 
a symplectic matrix S composed of a same rotation O in the q1,2 and p1,2 
planes, so that ||r|| = || ST r} ||. It thus follows that Rf (ri) = Ri(S~'rh) > 0, 
whence po and thus p are separable. 
If I3 = 0, let y1 > %2 = 0 and choose x? = a/b, y? = 2Vaß so that 


2a? 0 Wwivab 0 
yf oo 12 0 0 
oO | QnJ/a8 0 20? 0 
0 0 0 41/2 
j 1 
Similarly as before, one checks that Vj > +50 Y= i 


Positive Maps and Semigroups 


We have seen in Section 6.2 that positive but not completely positive maps 
cannot be directly used as mathematical descriptions of fully consistent state- 
transformations; however, they play a major role as entanglement witnesses 
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(see Proposition 6.2.1). Unfortunately, since there are no general rules that 
allows one to identify positive maps, only particular instances of them can 
be provided [179, 84, 85, 86, 114, 67]. The one which follows assumes the 
existence of just one negative eigenvalue in decompositions as in (5.41) that 
is smaller in absolute value than all the other ones [35]. 


Proposition 6.2.1. Let {L,}¢_, be a Hilbert-Schmidt ONB in Ma(C) and 
A: Ma(C) — M,(C) a positive map with a decomposition 


d2 
AX] =S° 4, L,XL,, X €M(C©), 
k=1 
where 0 < Li < bind fori > 2 while l = —|e,| < 0, with |&l < by. If 
1—||Z,||? 
\e1| < bo |Z , then A is positive. 


|Z]? 


Proof: The matrices Ly, form a Hilbert-Schmidt ONB, thus, using (5.30), 
it turns out that, for all normalized y, ¢ € C4, 


d? d 


Elertia = Svt eNe) = (WTNA) = 1. 


k=1 k=1 


Then, since ||L;||? = IL$? < Tr(LİL;) = 1 (see Remark 5.2.4), it follows 
that 


| 2 2 


= [ea [Itl 


g a 
(| (Za rlo) tolza) DESDI 
k=2 


k=1 


> fo(1 — |L?) — alll? . 


P 2 
= ts — (2+ lea) |rt] 
1- |Z|? 


I|L1 ||? 
malized 7, € C4, whence A is positive. 


If 4] < 2 , then it follows that (¢|A[|@)(@|])~ = 0 for all nor- 


In Remark 5.6.4.6 it has been stressed that, apart from the fact that 
the Kossakowski matrix C = [C;,;] cannot be positive, there are no general 
prescriptions on C such that the corresponding semigroup surely consist of 
positive, but not CP maps; as well as for positive maps, one can however seek 
sufficient conditions. 

In the following, we shall consider a system consisting of two d-level 
systems Sq and construct [35] a semigroup of positive, but not CP maps 


T; = y} @y7 on Ma(C) ® Ma(C), where a) = exp(tLı) is a semigroup of CP 
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maps, while ny?) is a semigroup of positive, but not CP maps ê. The construc- 
tion will also provide non-decomposable positive maps able to witness bound 
entangled states within a particular class of bipartite states with d = 4 [36]. 
We shall consider generators as in Proposition 5.6.1 without asking for the 
positivity of the Kossakowski matrix. 


Proposition 6.2.2. Suppose nh?) : Ma(C) + Ma(C) to be semigroups with 
generators 


LIX] =i[H® , x 1+ Sa "(er CO o EER, 


fori = 1,2, where Gg? = (G® )t € Ma(C), together with GË = 1/vd, form 
two Hilbert-Schmidt ONBs in Ma(C). 

Assume cf) > 0, 2=1,2,...,d?—1, and c£ = —|c?)| < 0, for one index 
k, while ci) >0 forL#Ak. Then, the semigroups of maps T; = a) ay o ) 
Ma(C) ® Ma(C) preserves positivity if of) > lP], €=1,2,...,d?-—1 and 


e > j], €=1,2,...,d2-1, L£ k. 
Proof: According to [177, 178] (see also Al in order to show that the 


semigroup {I;}i>0, with generator L = L®) & ida + idg ® L®), consists of 
positive maps, it is sufficient to prove that 


Iy, p) = (P| Ll] ¢)( el] |v) 2 0 
for all orthogonal 7), ¢ € C? @ C4. Since (|) = 0, it follows that 


Pi y ga > 
Tend) = E P eP oro] + X P komne? e) 
4 (= 


It proves convenient to define the following d? x d? matrices Y = [ij] and 
P = |ġpi;] where Y;j and ġij are the components of the vectors ~ and ¢ with 
respect to a fixed ONB {| ij} in C4 & C?. Then, one rewrites 


- d?-1 

-E4 GPa] + D P meP man] 

c= l=1 

= = 2 
= Ff - e) [res Devt] + E P [eP wta) 

l=1 kAl=1 


2 2 
TrGPov)| — |Tr(GP(wia))| 


d?-1 

2 
+ |e?) | S> 
=1 


°If the two semigroups 7 were the same, then, according to Proposition 5.6.1 
and the successive remark, I, positive would mean I, CP. 


(1,2) 
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As (y|) = 0, the matrices Pt and W'@ are traceless; using the ONBs 
consisting of the matrices GY = ry, €=1,2,...,d? — 1, one thus gets 


d?-1 


Y (D(a ov) -x((Sr4 Tr(G ow") JGP owt ) 


= 
=n (n'y? = Tr(wt6)? = Tr((wie)7)? 


aot 
=> (T(GP (wa)? 
f=1 
This yields 


d?-1 
2 
+ > meP wey , 


kAl=1 


2 
T(G (Swt) | 


d?-1 
TGP wian < S 
f=1 


whence one concludes 


yp) > YP = |e) T(P ar] 


g{=1 
d?-1 3 
2 2 2 
+ (i = |e) Ta, wia))| si 
ł=1 


Example 6.2.3. [35] Let d = 2 and og, a = 0, 1,2,3, be the Pauli matrices 
plus the 2 x 2 identity matrix go. Let Sa : Mə (C) +» M2(C) be the completely 
positive map X > Sa|X] = oa X oq, and set 


3 3 
1 1 
Xe 7 2 Sal X 7 2 €aSalX] , 


where €a = 1 when a Æ 2, whereas €2 = —1. The first map amounts to 
the trace map Tro (see (5.30)), while the second one corresponds to the 
transposition Tə with respect to the basis of eigenvectors of a3: indeed, it 
changes 02 into —o2 and leaves all other Pauli matrices unchanged. According 
to Proposition 6.2.1, it is positive but not CP , for Aag = diag(1,1,—1, 1) 
and ||oa]|? = 1. 

Consider generators Lı as in Proposition 6.2.2 with d = 2, F; = 0;/ V2 
and choose as Kossakowski matrices 


1 0 0 1 0 0 
c®=|0 1 0l, c%=10 -1 0 
0 0 1 0 0 1 
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Then, the corresponding master equations read 


1 3 
3(X Sila - 3) 


3 
aP = Lld = 5 (Sse Sil = o) 


D 
Ta 
D 

l| 

E 
= 
D 

Il 


The second one has been considered in Example 5.6.5 and generates a positive 
semigroup such that 


1 e72t = 
a [o] = 5(1 + pigit+e t b909 ar p303) =p F ~z 0202 : 
Since Ly[o;] = —20; while L [a9] = 0, the solutions of the first master equa- 


tion are the following CPU maps, 


1 gaat. 1—e-2t 
Og = OP = tee. 


Let idn, Tr, and T, denote identity, trace and transposition operations on 
(C?)®"; since 1 = Tr(p) and p — T[p] = p202, one rewrites 


e72 


of) = eid + l-e 7 2 1+e7 1— e7% 
WD = = 


5 idə + 7 To. 


t 
2 Tro ’ Vt 


As T4 = Tz 9 T2 and Tr2 o Tz = Tre, the tensor product maps T; = y} ® 77 
can be recast in the form 


_p,l+e77¢ 1—e4 


t 
I =e 5) id4 + 1 Tre ® idg 


No SS 


1—e! 1—e?! 
m — 2 (eT, Bid? + — — Tr ® ide ) oT4. (6.22) 


N: m 


T2? 


The semigroup {I;}:>0 consists of positive maps because the chosen Kos- 
sakowski matrices satisfy the sufficient condition of Proposition 6.2.2; more- 
over, the maps I; are of the form T, = T} + T? o Ty. It turns out that T} 
is completely positive for all t > 0 for it is the sum of tensor products of 
completely positive maps. If T? were also CP, each map T, would then be 
decomposable; in order to check whether this is true or not, consider the Choi 
matrix id, ® A;[P#], where 


Ps e2t 
Ay = eT, & idə + ~z Te Q idə . 
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Fixing a basis {|0),|1)} € C? and writing |84) = 4 7} ,_,|ab) @| ab), one 
explicitly computes 


id, @ 4 [P{] = 
(1+e-%) Pp? 0 0 0 
0 (Le P) 2e- P? 0 
0 2e- P? (le Pe 0 
0 0 0 (1 + e7?) P2 


This 16 x 16 matrix has eigenvalue 0 with eigenvectors 
(Ê; ),0,0,0) , (0,1Ê;),0,0) , (0,0,|% ),0) , (0,0,0,1 Ê;)) , j = 2,3,4, 
where |v; ) are the Bell states orthogonal to |Woo), while 
(|'¥oo),0,0,0) , (0,0,0,|%oo)) , (0, |Poo), Zoo) , 0) 


are eigenvectors relative to the positive eigenvalue (1 +e7%)/v2. More inter- 


3¢ i" ` 
esting is the last eigenvalue — with eigenvector (0, |Yo), —|Woo), 0): it 


is positive only if t > t* = (log 3)/2. It follows that I, is surely decomposable 
for t= 0 (Io = idi¢) and for t = ~: 


In order to ascertain whether the positive maps I; constructed in the 
previous example are not decomposable for 0 < t < t*, we need some further 
insight. Indeed, the decomposition (6.22) need not be unique and there might 
be other decompositions revealing that I, is decomposable for all t > 0. In 
order to proceed, we use the following result. 


Lemma 6.2.2. [35] Let A : Ma, (C) — Ma,(C) be a positive map and p a 

PPT state of a bipartite system Sı + S2. If Tr (ida, ® APS] p) < 0, the 

state p is bound-entangled and A not decomposable. 

Proof: If A is decomposable, so is its dual AT : Ma, (C) ++ Ma, (C); indeed, 
A= A +420 Ta SA sA + Ta o AF = AT + Âz 0 Ta, 3 


where AT is CP for it is the dual of a CP map, while Ay := Ta, o AZ o Ta, is 
also CP as the corresponding Choi matrix is positive. In fact, 


dy d2 
ida, ® AJE] Ea 5 ER 8 (Ta, 2 Ag) [EF] = Tad, © by EF @ Ag [EF] 
ij=l ijl 


= Taide ° (ida, & AT)[E”] 20, 
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where Taaa = Td,°T a, is the transposition on Mq,a,(C) = Ma, (C)®Ma,(C). 
It preserves the positivity of the Choi matrix idg, & A7[E®] associated with 
the CP map Aj. Since p is assumed to be PPT, if A is decomposable or p 
separable, then ida, & A? [p] > 0. 


To make good use of the above result, it is convenient to introduce a par- 
ticular class [34, 33] of 16-dimensional density matrices as states of a bipartite 
system consisting of two pairs of qubits; their structure is simple, yet flexible 
enough to represent an interesting setting where to test the decomposability 
of a wider range of positive maps A: M4(C) — M,(C). 


Example 6.2.4. Let S = Sı + S2 be a bipartite system where S1,2 are each 
a two qubit system; consider the sub-class of 16 x 16 density matrices con- 
structed by associating to the pairs of the set Lig := {(a, 3) }%, go the vectors 
Wap) := (l4 Q oag) |74), where |W4) is the Bell state | Woo) in (5.164) and 
Tag ‘= Ca ® Og are tensor products of Pauli matrices with og = 1. The 
vectors | Yag ) form an ONB in C’®, 


(Yo |Uys) = (PINs 8 oapoysl?t) = 7TR(o904)Te(0908) = Faqs - 
Given the corresponding orthogonal projections 
Pag = (Pap) Papl = (ida ® oap) Pt (ids ®@ ong) , Pag Pye = Ban Sse Pag ,» 
consider the states consisting of all oe convex combinations 
ae Ds Pap , 
T (a,@)el 


where J is a subset of Lie and Ny, its cardinality. 
The behavior of such states under partial transposition_can be deduced 
by means of the fact that the flip operator V = d id4 ® TalP4] (5.32) is such 


that Vti) = = \w2) and V(A 8 B)V = B® A for all A, B € Ma(C); while 


A® Bie) = Jabal @ Bli) -55i j\Alé)|5) Bli) 


4,9=1 
1 . eae : ~ 
= DL @ 37 Bli)(i| AT lj) = 148 BAT |B4) . 
j=l i=1 
Then, setting Pag := ida 8 T4[Pap] = il, ®oag V 148 cag, it turns out that 
~ 1 A 1 A 
Pagal Dys) = 74 8 Cap V la ® GagrysV V4) = Fans ® Capl¥+) 
eer ( \T 4) = 1, @ |24) 
= 7s 8 CaplTapaya) Wy) = EakyE pes la B Capo75Fa|W + 


1 
= grees] Dys) , 
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where it has been used that oF = €q0q With eg = 1 if a Æ 2, = —1 otherwise, 
that the algebra of the Pauli matrices implies 


1 1 1 1 


~ Z 2 1 1-1-1 
Oa0yOa = Nay Oy; N:S [an] =l ii 141 


1-1-1 1 
and it has been set 


b t= 1 


: 1 1 1-1 
Nay = EaEy Nay » n := [Nap] = —1 1 1 1 


1—1 1 1 


The vectors |W5) are thus eigenvectors of Pag with eigenvalues nayngs and 
pr := id, ® Talpr] = 5 (i 5 feri) Pag 
(7,5)€Li6 T Ca, p)EI 
Since the matrix Nay is symmetric, the eigenvalues of py can be recast as 
Tay DL "aes = (nX? n)ys 
T (a,pyet 
where X? is a sort of characteristic matrix of the sublattice 7 with entries 


x mw = IN, if (u,v) € I, = 0 otherwise. Concretely, consider 
I 


He OF © 
ell el = =) 
oro © 
oo Oo I= 
PROF 
=. OOO 


Thus, J is positive, hence p is PPT. 

We now show that Tr(id4 ® I;[Poo]e) < 0 for 0 < t < (log 3)/2, where T; 
is the positive semigroup of Example 6.2.3; by Lemma 6.2.2 it thus follows 
that p is PPT entangled and I; Da m that time-interval. 

3 


1 
Since Tr2(-) = 5 X Sy [-] and Tof- PO , the two semi- 
p=0 


groups in Example 6.2.3 can be recast in the joan. 


6.3 Relative Entropy 287 


3 3 
qa) 143A: l 1— A: B 3+ 1— A 
wec So + 7 2, Si, W = 7 So + 7 2, EGS; , 


where A; := e7”, so that, with Sagl: ] := Cag + Cag, 


(3454-35) G= FANS 
I= 6 Soo + 16 2 


1 2rz) = 
Tai 2y. a S es 


i,j=1 
nenne Ot apia a h F At) Dro 
i us ae =a aku ales 3 EjP;j. 
i, j=1 
It then turns out that 
0 < t< (log3)/2 => Tr(id4 Q T; [Poo] p) a Aut = 84) 2a, 


6.3 Relative Entropy 


A notion directly related to the von Neumann entropy with several useful 
applications in quantum information and of great importance for the topics 
discussed later in the book, is the quantum relative entropy. It is the quantum 
counterpart of the Kullbach-Leibler distance (see (2.94)) and has already been 
introduced in the proof of some properties of the von Neumann entropy (see 
Proposition (5.5.6)). 


Definition 6.3.1 (Relative Entropy). Let S be a quantum system de- 
scribed by a d-dimensional Hilbert space H and p,o € + (A) two density 
matrices acting on H. The relative entropy of p with respect to o is 


Ste = {ne (log p — log c)) if Ker(a) C Ker(p) (6.23) 


+00 otherwise , 


Ker(o) and Ker(p) being the subspaces where o and p vanish. 


Example 6.3.1 (Relative modular operator). [225, 237, 261, 262] 

Given the Hilbert space H = C?, equip the algebra Ma(C) with the 
Hilbert-Schmidt scalar product (5.26) and denote it as << -, - >>. Then, 
consider the following linear operators on Ma(C): 
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Lx[Y]=XY, Rx[Y]=YX. 


They commute with each other, Lx Rz|Y] = XY Z = RzLx[Y]; moreover, 
if X = Xİ they are self-adjoint. Indeed, the ciclycity of the trace operation 
yields 


229 Lx|Z] >> = wyi xZ) = T((xY' Z) =<< Lx[Y], Z >> 
<< Y, Rx|Z] >> = m™(¥t ZX) = Tr((vtxyt Z) =<< Rx[Y], Z >> . 
Further, if X > 0 the operators Ly and Rx turn out to be positive; in fact, 

<< Z, Lx|Z] >> = T(zi xz) >0 
<< Z, Rx|Z] >> = T(z ZX) = Tr(Z x zi) >0. 


Let P; and P; be orthogonal projections; then, Lp,Lp, = ðijLp, and 
Rp, Rp, = dijRp,. As a consequence, from the spectral representation X = 
Xt = M x; | 2; )( x; |, one derives the spectral representations 

d d 


Lx =Y ti Leye Rx =} ti Rim yal > 


i=l i=1 


with orthogonal projections {Lj z,)(2, it as respectively {Rj x; )( x; eae Sup- 
pose p € B}(H) is strictly positive, then Rp- = Rz” is well defined as well 


as the relative modular operator of p and o € By (H), 


Aggie leh = R3 Lo = X. Bee Lsi yts: Biri yirlo (6.24) 


where the spectralizations p = D rjļr;){r;| and o = aR sil si) ( 8; | 


have been used. If both p and ø are strictly positive, the same is true of their 
relative modular operator: for all X € |mdd, 


d 


<< X, ApoX >> =Tr( D> sip? KaKa) 
ij=l 
c 2 
= D sirg” Kril XT |s: )| >0 
ij=l 
Also, 
d 
log Apo = log Lo + log R7‘ = Y (logs: Li 943654 — log r; Biel) : 
i=1 


This yields the following expression for the relative entropy 


<< VP, —(log Apo) [Vp] >>= S (p, 0) . 
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Of the following properties of the relative entropy, joint convexity and 
monotonicity under CPU maps have already been used in the proof of 
the properties (5.162) and (5.163) of the von Neumann entropy in Propo- 
sition 5.5.6. 


Proposition 6.3.1. The relative entropy of p,o € Bi(H) is 


1. positive: S(p; o)>0, S(p; 0c) =0 iffp=a; 
2. jointly convex: given weights A; > 0, i € I, Yijepri = 1, and density 
matrices pi, oi E€ Bi(H), i€, 


S (= Xi Pi 3 > a] < XOCAS (pi, gi) ; (6.25) 


icl icl icl 


3. invariant under unitary maps: let p,o € BY (H) and U : H > H a unitary 
map, then 


S (UpU' , UcU+) = S (p, 0) ; (6.26) 


4. monotonically decreasing under trace-preserving CP maps: let p, o € BI (H) 
and F : BY (H) + B] (H) be a CP map such that Tr(F[p]) = Tr(p), then 


S(Flp], Flo]) < S(p,¢) . (6.27) 


Writing F[p] = p o E, where E : B(H) + B(H) is the dual CPU map of F, 
monotonicity reads 


S(poE,aoE) < S(p,o) . (6.28) 


Also, joint convexity is equivalently expressed by the inequality 


s (xe , Ta) < YOA S (i, %) , (6.29) 


icI icI ier 
where pi := A;p; and Oj := Aici. 


Proof: 

e Positivity: by means of the eigenvalues rj, Sẹ (repeated according to 
their multiplicities) and of the eigenbases |r; ), | s, ) of p, respectively a, one 
computes 


S(p,0) = X` ri(logr; — (r; | logo |r; )) 


= Y ri(los 7 S I(r; | 8&))|? log sr) 
i k 
> X ri(logr; — log(}> spl (r;| sk )1?)) 


k 
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= $ ri(logr; — log((ri| |r:}) 
> Siri —(rilolrs)) =0. 


The first inequality follows from >>, |(ri|s,)|? = 1 and the concavity of 
the logarithm, while the second one is a consequence of the concavity of 
n(x) = —zlog z, 0 < a < 1 (see (2.85) in Section 2.4.3), which also implies 
that equality only holds when the eigenvalues and thus the density matrices 
coincide. 

e Joint convexity: we shall establish (6.29). Notice that, for all w > 0, 


-togw= | ae(— ~ 75) 
oo 1 1 1 
=a- f d (Faar at oa) 


% dt (w-1)? 
=(1-u)+ f (+t? w+t ` 


Then, use the spectral representation of the relative modular operator (6.24) 
to insert Ap, in the place of w, act with — log A,,, on p and take the trace 


of the resulting matrix. Since Tr(a — Ap,o)le]) = Tr(p — o) = 0, it follows 


S(p, 0) = -Tr(1og Anold) = | ptl- aala) 


Further, setting Y = 1 and X = (A,,, +tl)~![p — o] in 


<< Y, (1- 4po)[X] >> = << Y , (Rp — Lo)R7 [X] >> 
ZKS (Rp -= Lo)[Y], Ra [x] >>, 
yields 
© dt 1 
So o= fet ((e-o) aore- e). (6.30) 


Let now p; and õj be as in (6.29) and 
Xj — (Le, Eg tR) "P [5 a oj] = (Le, + tR;,)'/?[B] , 
with B = Bt € M,(C) to be defined later. Then, since the various opera- 
tors are self-adjoint with respect to the Hilbert-Schmidt scalar product, by 
observing that >`; (Le; + tRp,;) = Lo + tRp, one obtains 
0< XL << Xj, Xj >>=) << pj — čj, (La, + tR) [5] >> 
J J 


-<<p-0, B>>-<<B,p-—o0>>+<< RB, (Lo +tR [B] >>. 
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By choosing B = (Lo + tR,)~'[p], one gets the inequality 


5 << pj — õj , (La; +tRz) [Ø — õ;] >>= 
j 


= DORs — 8) be, + ys 31) 


> 1H((p— oN, 4 sat : 


which, once inserted in (6.30), yields the result. 

e Invariance: it follows from the fact that log(UpU') = U(log p)Ut and 
that the same holds for ø. 

e Monotonicity: it is implied by joint convexity. We shall first show that 


S(p1, 01) < S(pi2, 012) , 


where p12,012 € B} 1 (Hi2) with marginal states p1,2 = = Tro, 1p12, respectively 
01,2 = Tro1012. et di = dim(H;), fix an ONB B ALN d2 2. in Hy and define 
the unitary matrices Up E€ Ma, (C), £ = 1,2,...,de, with entries (Ug)jk = 


n 
Sik exp (= je). Then, for all X € Mz,(C), 
2 


d2 d2 
1 TA 
= A UaU P UG- j| X |k) | I) k| 
é=1 j,k=1 


SÑIDGIGIX H): 


p pı = idı ® [p12] and similarly for o1. Furthermore, using the basis 
d2 1 . 
{| 3) }42, to write p12 = D51 Pk Q |J) k], then 


dg dz d 
S(p, 01) =S ye ; pac, å ye (of? , ai} 
j=l j=l j=1 
d2 
=5 E e0 Jz ap @ la) 
j=l 
= a Q [pre], id, ® Bloy9]) 
d2 
LaS S ((h ® Ze)pi2(l1 8 Ze)! , (h ® Ze)oi2(l1 8 Ze)") 
2 21 
= S (p12, 012) , 


where the second equality follows from the orthogonality of the matrices 
contributing to the sums, the last equality follows since the matrices 1, ® Ze 
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are unitary and because of the invariance of the relative entropy, while the 
two inequalities are a consequence of its joint convexity. 
Monotonicity under trace preserving CP maps thus results from Re- 


mark 5.6.3, by writing E[p] = Trg (u (p ® pr) ut): 


S(E[p], E[o]) < $ (U (p 8 pz)U', U (0 @ pp)U') 
=S (pP9pe, 08pe)= S(p, 0). 


Example 6.3.2. As an application of joint convexity, let p € Bı (H) be a 
density matrix describing a statistical mixture {Ajj, pij} p = Jij ÀijPijs 
Xy Aij = 1. Setting Pij := Vij Pigs pt = par Pij and D = X; Pij» one derives 


X AS (Piz P =m Trpi; log Jij — Trp; log p — 2 Aij log Aij 
J 


“rs (bij, A) +S ( pis p -S vhs 


whence (6.29) applied with reference to the sum over the index 7 and the fact 
that >, 2} = p yield 


X rigS (p13 » AYSE, p) +S S (5, p) — So Aig log Aij 
tj j i ij 
=X S (65,7) + 955 (ei, p) 
J i 


+ S> Ablog A; + D ~ 210825 


~ 
P3 

where A = Dag Me Da ob ob 
j j 


The physical interpretation of the relative entropy comes from thermody- 
namics: there, it amounts to free energy. As such, it can only decrease under 
dissipative time-evolutions [286, 189]. Let o in (6.23) be the Gibbs state at in- 
verse temperature 3 = T7} with respect to a Hamiltonian operator H € B(H) 
(see (5.177)): 


o = pg := Zgexp(—GH), 23° =Tr(exp(—GH)) . 


The free energy of a state p € B,(H) is 


F(p) := T S(p) —(H)p, (H)p:= Te(p H) . 
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From F(pg) = —T log Zg it follows that 


Slo; pe) = —S(p) — log Za + B(H)p = B(F (oa) — F(p)) - 


Let S undergo an irreversible evolution described by a quantum dynamical 
semigroup y% : S(S) + S(S), t > 0, with pg an equilibrium state, namely 
[038] = pg. Then, as seen in Chapter 3, the dynamical maps y+ are com- 
pletely positive and fulfill y% = Yt—-s O Ys, t > s. Thus, monotonicity (6.27) 
yields 


8 (velel: vlea) = Srel; 20) = 8( Flea) - Folo) 
= S(t- o ysle]; 4-s[Psl) 
8(rsl6l; pe) = 8(Fles) — FOl) (6.31) 


While the relative entropy behaves monotonically, this is not true of the 
von Neumann entropy (see Example 5.6.3). For instance, if the quantum 
dynamical semigroup mentioned before represents the reduced dynamics of 
a quantum open system S interacting with a reservoir, the free energy of an 
initial state may decrease in time, showing tendency to equilibrium, while 
but its von Neumann entropy may in some cases decrease (for more details 
see [41]). The following example provide a class of dynamical operations on 
the states of S which always increase the entropy of its states (or keep it 
constant). 


Examples 6.3.3. 


1. Bistochastic maps [302, 303] Completely positive unital maps E : 

B(H) ++ B(H) are called bistochastic if their dual maps F : S(S) + S(S) 
1 1 

preserve the tracial state: F A =y 

The most natural bistochastic maps are those associated with projec- 

tive POVMs, Fp[p] = x P;pP,, PP; = bij Pj, eee = 1 (see Sec- 

tion 5.6.1). These maps always increase the von Neumann entropy; in- 

deed, from (6.27) 


s (Fil. F | 5]) = 184 = $a) < $ (0, 5) = 084 ~ S00) 


2. Let P be a projective POVM as in the previous point. The linear span 
of the orthogonal projectors P;, i € I (not necessarily one-dimensional, 
so that card(I) < d) is an Abelian subalgebra Ap C B(H) with identity, 
whose typical elements have the form a = [i q 4 Pi. The space of states 
u over Ap consists of normalized, positive linear expectations 
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Ap rar ula => awh 


wel 


They thus correspond to all possible discrete probability distributions 
with card(I) elements. Given a state p E€ S(S) its restriction to Ap, 
denoted by p|Ap, corresponds to the discrete probability distribution 
Mp := {Tr(p Pj) }ier. It thus follows that p|Ap = Fp[p] = Xer Pip Fi, 
indeed 


T(E a) = 5 a; WPRP) =X a; Tr( p P;) 


i, jEI iel 


From the previous point, it then follows that 


S(p) = min{ S(p |A) : A C B(H) Abelian with identity } : 


the minimum being achieved at any Abelian subalgebra A generated 
by the eigenprojectors P; = |r;)(ri| of p, for in this case p,(P;) = 


Tr(p|ri) (ri |) = ri- 


The following result emphasizes the connections between von Neumann 
entropy and relative entropy [213]; the idea is to exploit the (infinitely many) 
convex decompositions of mixed states. 


Proposition 6.3.2. Let S be a quantum system described by a Hilbert space 
H and let p= J er Àipi; Ài = 9, Dep Ai = 1, be any convex decomposition 
of a mixed state p E€ S(S) in terms of other density matrices pi E€ S(S). 


Then, 
S(p) =min{ > Ai S(pi; p) : p= >> ri} 
icl ie. 
Proof: From (6.23), XC A; S(pi; p) = Slo) — X AiS(pi) < S(p), while 
ier icl 


the spectral eigenprojectors of p give the upper bound. 


6.3.1 Holevo’s Bound and the Entropy of a Subalgebra 


As seen in Example 6.1.2, by encoding classical information into non- 
orthogonal quantum states one may always detect the presence of eaves- 
droppers during transmission. However, the non-orthogonality of the quan- 
tum code-words does not allow for perfect retrieval of the encoded classi- 
cal information, for no measurement can perfectly distinguish between non- 
orthogonal states. 

In fact, let | Y1) and | Y2) = aly) + BLYE ), a £0 be two vector states 
in some Hilbert space H. Suppose there exists a set of orthogonal projections 


6.3 Relative Entropy 295 


Pj on H, 7 € J = J, U J2, such that if 7 € J), respectively 7 € J2, is measured 
then the state is %1, respectively %2, with probability 1. Then, the orthogonal 
projections Q1,2 := ) jej, , Pj, would be such that (%1,2| Q1,2|¥1,2) = 1, 
whereas (1, 2 | Qa, ý ly, 2)= =Q; Therefore, 


= (p| Q2 lve) = I6 (YE IQ lyt) < (BP? <1 = |b| =1 


namely Yı L qe. 

More in general, the symbols į € I4 = {1,2,...,a} of a classical alphabet, 
emitted with probabilities p1, p2,..., Pa, might be encoded by means of mixed 
states p; € Bı (H). Or, from a more realistic viewpoint, given an encoding of 
the classical symbols into pure states %; € H, a noisy transmission channel 
might transform them into mixed states p; := F|| Y; )( y: |], where F is the 
dual of a CPU map E : B(H) — B(H). The receiver must then reconstruct the 
encoded classical message with the least possible error; practically speaking, 
he must seek a POVM B = {B;}ier, C B(H), Ig = (1,2, ...,b}, such that, 
when measured on the statistical mixture p = $ aer, Papa, it maximizes the 
accessible information. 

In such a context, three random variables appear: A, B, and AV B, with 
probability distributions 74, mp and mays: 


1. the outcomes of A correspond to the indices i € I4 of the incoming states 
and TA = {Pa}acla; 

2. the outcomes of B correspond to the indices i € Ip of the POVM and 
mB = {Tr(p Bihiers; 

3. the outcomes of AV B correspond to the joint events consisting of an in- 


coming state pa and a measured index i: TAvB = {pa Tr(a Bi)} . 
a€la,iE€IB 
According to Section 2.4.5, the mutual information I(A, B) measures how 
much knowledge one gains about A, that is about which state p; has reached 
Bob, from measuring on B the POVM B = {Bi hier: 


I(A, B) = H(A) + H(B) — H(A V B) 


=< > Pa log Da = 5 (Tr(p Bi) log(Tr(p B;) 


acla i€lp 
+ XO SS pa(Tr(pa Bi)) log(pa(Tr(pa Bi))) 
a€I,i€lp 
= — J) (Tr(p B;)) log(Tr(p B,)) 
i€lp 


+9 pa J (Tr(pa Bi) log(Tr(pa Bi)) - (6.32) 


acl, i€lIp 


In the classical case, perfect knowledge of A from knowing B can be 
achieved by choosing B such that H(A|B) = 0; in the quantum case, there 
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is a more stringent upper bound on (A, B) that depends on the given de- 
composition p = aer, Papa and is denoted by x (p, {PaPa}acra)- 


Proposition 6.3.3 (Holevo’s Bound). Given p = Duer PaPa © Bi (H) 
and the POVM B = {Bisicr, C BH), 


I(A, B) < x (P, {P; PaPatacta) = S(p -> Dad (Pa) . (6.33) 


a€la 


Proof: [23] Given the POVM B = {B;}ier, C B(H), let B= {b; JjeIs be 
an Abelian algebra with minimal projections b;, bib; = - bib i ee b; = Íp. 
It can be embedded into B(H) (as a linear space) by means of the kear maps 
yp : B =œ B(H) such that yp(b;] = B;. Positive operators in B are of the 
form b = Diez, Gi bi, Bi > 0, therefore yp(b) = jez, BiB; > 0 so that 
YB is a positive map and, because of Example 5.2.6.7, completely positive. 
Also, yg(lg) = eis (bj) = ets B; = 14, whence yp is a CPU map. 
Furthermore, the states poyg and p;°yg on B are diagonal density matrices 
with eigenvalues {Tr(p B; bier, respectively {Tr(a Bi}ier,. Thus, (6.32) and 
the monotonicity of the relative entropy (6.27) yield 


T(A,B) = X` pa S(p°71B Pa YB) < >> Pa S(p, Pa) 


acl, acl, 


Remarks 6.3.1. 


1. Using (5.156) one derives that 


x (P, {PaPa }acr4) = S(p) — >> pa S(pi) < — D> palogpa = H(A) . 


a€l, acl, 


Thus, if (6.33) is a strict inequality, perfect reconstruction of A upon 
knowledge of B is not possible; on the other hand, the inequality is strict 
unless the states pa are orthogonal to each other and thus perfectly dis- 
tinguishable. 

2. A consequence of the Holevo’s bound is that any quantum encoding of 
n bits into n non-orthogonal qubits states | y;} € C?” achieves secure 
transmission, but cannot transfer more than H(A) < n bits of informa- 
tion (when the entropy is expressed in base 2). 

3. Whether the upper bound is achieved or not depends on the ability on 
the part of the receiver to find one or more optimal detection strategies, 
namely those POVM ’s B that maximize I(A, B). As we shall see this is 
a remarkably difficult analytical problem even in low dimension. 
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4. By taking the supremum over all possible POVM ’s B that the receiver 
may devise as detection strategies, one defines the maximal accessible 
information as 


I(A) := sip I(A, B) < x (p, {PaPa tacta) - (6.34) 


Example 6.3.4. Suppose A transmits the bits 0 and 1 to B by encoding 
them into the non-orthogonal states 


+) = pli)  /1—ply2) EC, 0<p<1, (rile) =0, 


chosen with equal probability. The statistics of the encoded quantum signals 
is thus described by 


Ma(C) > p= 5|+)(+1 4 


I-)(-|=pldi)(vi| + (=p) | Y2) (2l, 


and x({p,Ap;}) = H2(p) = —plogap + —(1 — p)logs(1 — p) reaches its 
maximum of 1 bit of transmissible information only when p = 1/2 so that 
(+|-)=1-2p=0. 


The problem of achieving the maximal accessible information I(A) ap- 
peared earlier than in quantum information, in relation to finding optimal 
decompositions achieving the so-called entropy of a subalgebra [213, 88], the 
building block for constructing a particular quantum extension of the KS 
entropy to be discussed later in Chapter 8. 

Let M C B(H) be a finite-dimensional subalgebra with identity, p € 
(H) a state on B(H) and p|M the state on M which results from restrict- 
ing p to act (as an expectation) on the observables in M, only. 


Example 6.3.5. Let A C B(H) be an Abelian subalgebra with k < dim(H) 


minimal projectors @; and p € BI (H) a density matrix; then, p|A amounts 
to the classical probability distribution 74 = {Tr(pai) }4_, 


Definition 6.3.2 (Entropy of a Subalgebra). Let M C B(H) be a subal- 
gebra and p € BY (H) a state; the entropy of M relative to p is 


H, (M):= sup 4 °AS(e1M, pi[M) (6.35) 
p= Dier> iPi icI 
= S(p}M) - DIAS (pM), (6.36) 
P=} ier MPi icI 


where the sup in (6.35) and the inf in (6.36) are taken with respect to all 
possible linear convex decompositions p = Jcr ÀiPpi- 
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It must be stressed that, unlike in Proposition 6.3.2, the decomposition 
comes first and the restriction to the subalgebra only afterwards; this is what 
makes the explicit computation of the entropy of a subalgebra a compli- 
cated variational problem, in general. Luckily, there are particular instances 
of states and subalgebras where things are easier. 


Example 6.3.6. Consider the Abelian subalgebra A of Example 6.3.5 and 
let p € B} (H) be a state which commutes with all elements of A. The fol- 
lowing decomposition of p is optimal, 


k 


p=) _Tr(pai) pi, pi = 
i=1 


VP Gi VP _ pa; 
Trea) Toa) 


Indeed, the restrictions p[A and p; lA are probability distributions 


~ aE a _ f Te(paiai) t 
TAE {Tetea} TAT ca =e j 


j=1 


such that S (p; lA) = 0 and 


i 
H, (A) = S (pA) = — 3 Tr(p ai) log Tr(p ai) . 


The simplest context in which the above argument applies is when, instead 
of B(H), one deals with an Abelian von Neumann algebra. Then, as seen 
in Section 5.3.2, via the Gelfand transform, any finite subalgebra A C A 
has minimal projections which correspond to the characteristic functions of 
suitable measurable subsets of a measure space and thus identify a finite 
partition of the latter or, equivalently a random variable A. Moreover, the 
state w becomes a probability measure js and gives a probability distribution 
over A such that the entropy of A yields the Shannon entropy of A: Hu (A) = 
H(A). 

Instead, the simplest non-commutative application of the previous argu- 
ment is when B(H) = Ma(C) and p = 14/d is the tracial state; then, 


k T A 
Hy (A) = $(ptA) = H(A) =- $> TE tog TC 


We now relate the variational problem in (6.35) to the one in (6.34). The 
clue is that POVMs as B = {B;hier, give rise to decompositions and vice 
versa, while the main technical tools are provided by the GNS construction 
™,)(B(H)) based on the state p. Of particular importance is the possibility of 
dealing with decompositions by means of positive elements in the commutant 
7,(B(H))’ or even in 7,(B(H)) itself (see Remark 5.3.2.3 and the relation 


6.3 Relative Entropy 299 


n (5.146)), an instance of which already appears in the previous example. It 
follows that the probabilities in (6.32) can be recast as 


( 2p | Mo( Bi) Xa |p ) = Tr(p Bi) p(X") (6.37) 
Pa Pa 


(Rp | tp( Bi) Xa 2p) ! 
ge CD | OO 25 6.38 
Tr(p Bi) ’ p ( P a | B ( ) 


where | 2, ) is the GNS cyclic vector and the X} € 7,(B(H))’, a € Iu, are 
positive operators in the commutant such that jaer, Xa = 1. 

It turns out that the linear functionals 7,(B(H))’ > X’ + p(X’) are 
positive and normalized, hence states on the commutant, as well as 


p(X!) = Y (Telo Bi) p(X") = (2p |X" 12) - (6.39) 


1€lp 


Tr(pa Bi) = 


p(X) = 


In analogy with the proof of the Holevo’s bound, let A = {G@;}jer, be an 
Abelian algebra (with identity) generated by minimal projections a; and 
introduce the CPU map y4 : A > 7,(B(H))’, yiyla;] = X; that sends A 
into the commutant 7,(B(H))’. Then, using (6.37) and (6.38), one sees that 
the states p'o 7/, and p; o Y4, i E Ig, on A correspond to the probability 
distributions 7’, = {pataer, and (m4) = {Tr(p)(X")}aer,, whence (6.32) 
can be rewritten as 


I(A, B) = — Y` (Tr(p Bi)) log(Te(p B:)) 


i€lp 
+X SO pa (Tr(a Bi)) log(Tr(pa Bi)) 
a€l, iclp 
=- Ð (X (T(P B:)) ø,(X4)) log pa 
i€Ip a€la 
+ S > (Tr(p Bi) XC pf(X}) log ø; (X1) 
a€la i€lp 
=- YO palogpa + X (Tr(p B:)) XO p(X) log pi(X%) 
i€lp 1€lpB aEla 
= S(p' ov) — X (Tr(p Bi)) S(p; 0 74) - (6.40) 
1€lp 


Therefore, the maximal accessible information relative to the encoding 
{P, PaPa} equals the entropy of the CPU map y4 : A œ 7,(B(H))’ relative 
to the state p' on the commutant: I(A) = Hi (y4). If p is a faithful state, one 
can use (5.146) to substitute the X; with elements of a POVM in B(H) and 
y4 with a CPU map y4 : A+ B(H), so that I(A) = H, (ya). 


The natural embedding 7,4 of a subalgebra M C B(H) into B(H) isa CPU 
map (see Examples 5.2.3.7 and 8) such that p ÌM = pow. This observation 
suggests the following extension of Definition 6.3.2. 
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Definition 6.3.3 (Entropy of CPU maps). Given a completely positive 
unital map y : M +> B(H), where M is a finite-dimensional algebra, its 
entropy relative to a state p € B+ (H) is 


Ho(y):= sup SAS (p07, pion) (6.41) 
P=} ier MPi ET 
=S(poy)— inf NS (pio 7) . 6.42 
(p07) port sei (0107) (6.42) 
Lemma 6.3.1. 


1. Given a CPU map y: M +> B(H) from a finite dimensional algebra M 
into B(H), one has 


0<H,(7) < S(poy) < logdim(M) , (6.43) 


where dim(M) is the dimension of any maximally Abelian subalgebra 
contained in M. 

2. If p is a faithful state, then H, (M) > 0 unless M is the trivial algebra, 
consisting only of multiples of the identity. 

3. Consider two finite dimensional algebras Mı 2 and two CPU maps 4 : 
Mı = Mə, 72: M2 B(H), 


H, (72 0 71) < Hp (72) - (6.44) 


In particular, if N C M C B(H) are two finite dimensional subalgebras, 


H, (N) < H, (M) . (6.45) 


Proof: Positivity and boundedness are evident, monotonicity under CPU 
maps follows from (6.28) applied to Definition 6.3.3, while monotonicity under 
algebraic embeddings follows from considering the CPU maps consisting of 
the natural inclusions ım of M into B(H) and ana of N into M: 


Hp (N) = H, (tm ° inm) < Hp (tm) = Hp (M) . 


As regards the second property, suppose that H, (M) = 0, then, the first 
property of the relative entropy in Proposition 6.3.1 yields p |M = p;|M for 
all decompositions p = }°,<;Aipi. Then, consider the GNS representation 
of B(H) based on p and set p(M) := Tr(pM) = (2,|7)(M) |Q,), for all 
M e M. It follows that 


(R | Xi n (M)|2,) =p(M)(2,|X'|2,) equivalently 
(2| Xi(T (M) - AM) 1) 12) =0, 
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be written as a sum of positive 1 > Xi € 7,(B(H))’; then, since p faithful 
on B(H) implies | 2, ) separating for 7 )(B H)) and thus cyclic for 7,(B(H))’ 
(see Lemma 5.3.1), it follows that (m (M) — p(M))|2,) is orthogonal to a 
dense subset of the GNS Hilbert space H, whence, again from the faithfulness 
of p, M = p(M) 1 for all M € M. 


for all 0 < X; < 1 in the commutant 7,(B(H))’. Notice that any such X’ can 
Pp 


Example 6.3.7. The last result in Example 6.3.6 extends to subalgebras 
M C B(H) which are not Abelian but commute with the state p 7; then 


H, (M) = S (p}A) , (6.46) 


where A is any maximally Abelian subalgebra contained in M. Indeed, from 
Example 6.3.3.2 and the first case discussed in Example 6.3.6 it follows that 
S (pM) = S (p |A) = H, (A), where A C M is maximally Abelian; on the 
other hand, from Lemma 6.3.1, one deduces that 


S (pÌM) = S (pA) = H, (4) < H, (M) < S (p}M) . 


Apart for the simple cases discussed in Examples 6.3.6 and 6.3.7, the 
minimization of the linear convex combination of von Neumann entropies 
in (6.42) is in general an extremely difficult task. At first sight, one might even 
suspect to be forced to consider more than discrete convex decompositions of 
the state p; luckily, the following result ensures that H, (M) can be reached 
within £ > 0, by means of discrete decompositions [88, 222]. We shall denote 


by H? (1, pit) the argument of the supremum in (6.41) evaluated at a given 
decomposition p = J ;ez Aipi, namely 


H? (1, aiher) = >. Aier S (p07, Pie) - (6.47) 


wel 


Proposition 6.3.4. Let y : M ++ B(H) be a CPU map from a finite di- 
mensional algebra M into B(H) and p € B}(H) a density matrix. Given 
a ea P = Vierripi and € > 0, there exists a decomposition 
P= X jez àj Pj where card(J) depends on dim(M) and £, such that 


23 (4.00 Je = e Tn] <e. (6.48) 


Proof: Consider a finite partition Z = {Z;}jez of the state-space S(M) 
of M into subsets Z; such that 


"In such a case, one says that such M are contained in the centralizer of p. 
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01,2€ Z; = lou — 09]| <6 VZ; EZ. 


For instance, card(J) can be chosen not larger than the least number of balls 
of radius 6 that are necessary to cover S(M). Define 


By construction, p = -jej Ap; and 


(Paaka) — HE (1%. 05} e3)| $ [DoS (207) — D4 5 (04) 


tel jEJ 
<>) SO Aisir) - S (e) 
jes ier 
PioyEZj 


By choosing ô appropriately, the result follows from the Fannes inequality 
(see (5.157)). 


From this result, it follows that, for any £ > 0, there exists a decomposition 
P= J icr Ai pi With card(J) depending on dim(M) and £, such that 


H} (Li, pibier) >H,(7) - €. (6.49) 


We shall call e-optimal for y the decompositions which achieve H, (y) within 
£ > 0 and optimal for y those decompositions p = $- Aj; such that 


H, (7) = S (p07) = LS (07) 


6.3.2 Entropy of a Subalgebra and Entanglement of Formation 


In this section, we shall consider some techniques developed in [45, 46, 47] 
that are of help in calculating the entropy of a subalgebra H, (A) where A 
is a maximally Abelian (n-dimensional) subalgebra of a full matrix algebra 
M,(C). The first step is to extract from (6.36) the expression 


E |M, M] := 2 S (pi ÙM) , (6.50) 


P= lici Ai Pi 


where we have specified the state of the system, the total algebra of its 
observables M and the selected subalgebra M C M. Then, one notices 
that the variational problem can be solved by restricting to decomposi- 
tions of p in terms of pure states; this is so for the von Neumann entropy 
is concave (see (5.156)). In fact, assume p = 7, Aj;p; optimal for M (so 
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that E |M, M] = 5 A:S (pi \M)), with non-pure decomposers p;. Then, 
icI 
by further decomposing p; = Jp ÀjkpPjk, One gets another decomposition 
P = Ð; r AjÀjkpjr; thence, S (p; |M) > $. Aj (pjk |M) yields 
k 


E [M, M] < XL A;jAjrS (Pjr |M sds S (p; \M) = E |M, M] . 


j,k 


Notice that, since pure states P; cannot be decomposed, for them it holds 
that 
Ep, |M, M] = S (Pj |M) . (6.51) 


In a similar way, one shows that the functional E,[M,(C),M] is convex 
over the state space Bf (C"): given a convex combination p = >> 45 pj, the 
optimal decompositions p; = >>, Àjkpjk that achieve 


Ep, |M; =D YS (pjr |M 


for each j, provide a decomposition p = >> jk VjAjkPjk Which need not be 
optimal, whence 


Ey, V5 Pj [Mn(C), M ]< Soy Ajk S ( Pjk Ì M) < Xy S (pj M) : (6.52) 
j,k j 


We shall fix M = Mn (C) for some n; the following results turn out to be 
useful [47]. 


Proposition 6.3.5. For a fixed density matrix p € Mn (C) and M C M, (©), 


1. there is an optimal decomposition consisting of no more than n? decom- 
posers; 

2. the functional E |Mn(C), M] is linear on the convex hull of the optimal 
decomposers of p; namely, if p = X; Ai P; is an optimal decomposition 
for Ep[Mn(C),M], where the P; are projections, then any other conver 
combination p = >),v; Pj, with weights vj > 0, $3; vj = 1, is also 
optimal in the sense that, 


Ep{Mn(C),M] = $ v; S(P;[M 


Proof: The first statement results from a theorem of Caratheodory [20] 
since M,,(C) is n? dimensional as a linear space and the set of pure states is 
compact [304] (see Remark 5.3.2.5). 

The second statement is a consequence of (6.51) and (6.52); indeed, as a 
convex functional, E,[M,,(C),.M] can be expressed as 
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E,[Mn(C), M] = sup {A[p] : A affine functional on B[(C")} . 


Let E,[M,(C),M] = Alp], then Alo] < E,[M,,(C), M] for a different state 
g; thus, given an optimal decomposition p = 5°, A; Pj, where A; > 0 and the 
P; are projections, 
E,[Mn 2 Ai S (P, = > AiEp, |Ma (C), M] 
> S aAIP] -AEN P;] An (C), M] . 


Therefore, A[P;] = Ep, [Mn (C), M] for all i; consequently, if p = >), v; P} is 
any convex combination of these optimal projections, then 


Es[Mn(C), M] < X` vS (Pt = oral [M,, ( =a aa 


= Alp] < Ee[M,(C), MI l 


Calculating E,[Mn(C), M] can be simplified if the state p enjoys symme- 
tries that leave the subalgebra M invariant as a set; namely, suppose there 
exists a unitary matrix U : C” + C” such that Ta[e] = U pUt = p and 
I'T|M] = M, where F? : Ma (C) + M,(C) is the dual map of Tu. Then, 


Proposition 6.3.6. Let Ep|Mn(C), M] be achieved at the optimal decompo- 


sition p = X, à; Pi; then, the symmetry map I, gives other optimal decom- 
positions. 


Proof: From p = Talo] = X; à: Ta[P;] and L/P) |M = BTM) = 
P; îM it follows that 


E [Mn (C), M] < X à; S (Dual Pi = 2 S (P, E [Mn (C), M] . 


Particularly suggestive instances of states p € BY (C?) with symme- 
tries are those that are permutation invariant with respect to a given ONB 
{|i)}4_,; they are of the form 


1 x ror 1-2 


d 
1 1 
where | p+) := 7 y |i) so that -7—7 S2 <1. 
i=1 
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By defining F := (w+ | pe \Y+), 0 < F <1, one can rewrite py in a way 
which is directly comparable with the isotropic states (6.3): 


1-F E 
p® = IIE f Tp vel (6.54) 


which we shall denote in the following by E 


by changing d into d?; notice that 


as they are obtained from (6.3) 


a? d 
(flop WS) = (be lop We) =F. (6.55) 
Let m denote the d! permutations i+ (7), 1 < i < d; it turns out that 
1 
d = 
pe =a > Ulol, (6.56) 
T pr 


$ 


where |¢) € C? is any vector such that |(~4|¢)|? = F and U, unitarily 
implements the permutation of the chosen ONB corresponding to 7. 

Let A denote the maximally Abelian subalgebra generated by the projec- 
tions {|i)(i|}4_,; the decomposition (6.56) is such that 


E «[Ma(C), A] < =r 3 P3 | A) = 5 (P3 IA) 


d 
=— DN Gls) Plog l(ols)P =: rF). (6.57) 


Proposition 6.3.7. If p? is a permutation invariant state on Ma(C) and 
r(F) is a convex function of F € [0,1], the decomposition (6.56) achieves 
Eo [Ma (C), A]. 


Proof: Let p? = X Ai Pi, Pi = | 6: )(G;|, achieve E o [Ma(C), A] and 
F 


consider i 
d d 
T a % T 


pe 


i 


The states P” are permutation invariant; according to (6.56) they are com- 


pletely characterized by parameters F; that satisfy 
d u 
= (+ | PP? Ibe) = SoA (dy PE l) = OAL 


Thus, Proposition 6.3.6, the assumed convexity of r(F) and (6.57) yield 
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Ea [Ma(C = S (P¥ 1A) =X Air(F;) > r(F) 


> Bes [Mzu(C), A] . 


Examples 6.3.8. 


1: 


1 _ 
For d = 2, pP = AF 1 ') can be written as 


where a := 24/F(1 — F); then, with n(x) = —xlog x and the notation 
of (6.12), 


e= (=) ! (254) =m (4252) (6.58) 


In order to use the previous proposition, we need show that r(F’) is convex 
on (0, 1]; for this we calculate 


2 
d'r(F)_ 2 (os = 2a) l 
a 


dF? a? 1— 


The function within the parenthesis is monotonically increasing from 0 
to +00; the second derivative is thus non-negative and the function r(F) 
is convex. Then, 


E,@[M2(C), A] = He k Hvn) 


E ae (n) 
p 2 7 ; 


where A is the Abelian subalgebra of diagonal 2 x 2 matrices. Notice that 
this is the only Abelian subalgebra in the d = 2 case: in [45] H, (A) has 
been computed for all states p € M2(C). 


. Given a fixed ONB {|i)}¢_, in C4, consider the doubling map 
w=1 


d d 
MO > X= YP zalial DX] = $ egla. (6-59) 


It is a homomorphism from Ma(C) onto a subalgebra Mg C M,2(C), 
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d 


D[X]D[Y¥]= ŠO xij yreļii)(ji | bk)| et) 
i,j; k,ł=1 


d d 
= (>: Lik ns) Jii) | = D[XY] . (6.60) 


ij=1 \k=1 


It is thus a positive linear map from M,(C) onto Mo C Maz (C) where it 
is invertible: 


d d 
Mo 2 Xo = X` zizli) (jj | = DTX] = XO aiyli)(j| € Ma(C) . 
i,j=l i,j=l 


(6.61) 
Let d = 2 and |0),|1) be the fixed ONB in C?; when applied to the 


permutation invariant state in the previous example, the doubling map 
gives the state 


00 )00 +] 11)( 11 2F-—1 
R® = Dip] = POOL 4 FA (I00)(11 | + 111)(001) 
2 2 
1 
= 7 (lit QF - 1)(01 @ 01 — 02 8 02) + 03 8 08) 
1 0 0 2F-1 
if oe an gd 
7 0 0 0 0 
2F—1 0 0 1 


If thus turns out that RY as defined in (6.13) equals RY so that, for F # 
1/2, RË is entangled with concurrence E(R?) = |1 — 2F| (see (6.14)) 
and entanglement of formation (see (6.15) and Theorem 6.2.1) given by 


Ep[RY] = He k n a) = Ep [Ma(C), A] . 


. In [46], Proposition 6.3.7 has been used to compute E (3) [M; (C), A] where 
F 


1 3F—1 3F—1 
2 2 
Dijar j] aru 
PF — 3 2 2 
3F—1 3F—1 1 
2 2 


and A consists of diagonal matrices in this representation. While for d = 2 

there is only one optimal decomposition achieving E [M2(C), A], when 
F 

d = 3 more optimal decompositions appear. Indeed, one can decompose 


pe? by means of the unitary operator U : C? — C? that implements the 
permutation (1,2,3) + (3,1, 2): 


pO? = Flo dl+ ZUIA U+ ZUIAN, (6.62) 
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where 


a + 2bcos 0 3 
|ġ)} = | a—2bcos(8 — 7/3) | , a:= vV3F , b:= 4/ = (1 — F). 
a — 2bcos(@ + 7/3) 


It turns out that, for 0 < F < 8/9, the function r(F) = S(|¢)(¢| TA) is 
convex, whence 


E @[M:(C), A] =n (= ae = =) 


pan (1 rua) (6.63) 


6 


There exists a value 0 < F* < 8/9 such that E (a) [M; (C), A] is achieved 
F 
at a unique decompositions of the form (6.62) given by 
i VEF +/2F(1—- F) 
|g) = A vF- VF(1-F)/2 |, 
VF- FU- F)/2 
for F* < F < 8/9; while, for 0 < F < F*, two optimal decompositions 
of the form (6.62) appear with 


a + 2bcos 0p 
|p) = — | a—2bcos(tr/3F 0r) | , 
V3 Ng 2bcos(m/3 + 0r] 


where the angle 0p varies with F. According to Proposition 6.3.5, all 
linear convex combinations of the projections onto these vectors also 
provide optimal decompositions. When 8/9 < F < 1, the function 
r(F) = S(|¢)(@|[A) is no longer convex and one cannot use Propo- 
sition 6.3.7; in this case it is the close relation of H, (M) with the en- 
tanglement of formation which is of help. Indeed, (6.63) coincides with 
the entanglement of formation of the d = 3 isotropic states (6.3) for 
1/3 < F < 8/9 as calculated in [298]. 


In order to expose the relation between the entanglement of forma- 


tion (6.5) and E [M, M] in (6.50), set M = Ma(C) := Ma(C) ® Ma(C), 


M 


= Ma(C) embedded as Ma(C) ® Ig into Maæ(C) and p; = |Y; (Yẹ; |- 


Since the marginal density matrices Pas _ pj ÌM, it turns out that 


Ep(p] = Ep[Ma (C), Ma(C)] - (6.64) 


Further insights into the connections between these two notions, with partic- 
ular reference to Examples (6.3).2,3, come from [47] 
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Proposition 6.3.8. Let A C Ma(C) be the maximally Abelian subalgebra 
corresponding to a fixed ONB {|i)}¢_, and D|] is the doubling map (6.59); 
then, 

Ep[Ma(C), A] = Enjo [Maz (C), Ma(C)] . (6.65) 


Proof: Suppose p = )°,;P; achieves E |Ma(C), A]; then, with p = 
Ditu a 
d 
D{p] [Ma(C) = Tr2(D[p]) = >, ruļi){i|=p]A implies 


Epi |Ma: (C), Ma(C 53 Ai S (D[P;] Mu(C = NS (P, 
= E, [Ma(C), A] . 


Vice versa, let D[p] = 97,4; Q; achieve Eppo [Ma2 (C), Ma(C)]; if the optimal 
decomposers Q; were of the form Q; = Spp qlp|kk)( 2|, by the inverse 
doubling map (6.61) one would get a decomposition of p that could be used 
to reverse the previous inequality and thus prove the result. The decomposers 
Q; are indeed of the claimed form as they are one-dimensional projections 
that can always be recast as follows 


vDo 1%) 


A 
Qi = "HDI 


v Diol 
) 


Because of (6.60), it turns out that D[p 


expansion, \/D[p] = D[,/p]. 


Example 6.3.9. In [298], the entanglement of formation of an isotropic state 


pf) was computed by 1) considering the twirling (2) of suitable vectors of 


diagonal form, |®) = ar Jpil ti), with respect to the chosen ONB, and 


” = D[p"] whence, by power series 


d 
by 2) minimizing the von Neumann entropy S (o5 1) =— 5 Li log 4; of the 
i=1 
marginal density matrix. 


2 
Choose one such |Ø) from an optimal decomposition for Ep (pe! J and 
construct the density matrix 


2 1 
RE? = 3D Up @Uz|)(B|Uz" 8 Uz" 


by using the permutation operators U,. Since, by definition, Ur ® Ur are 
2 

symmetries for the isotropic state pt ) using Proposition 6.3.6 one deduces 

that E a2) [Maz (C), Ma(C)] = S (05°). On the other hand, in terms of the 


doubling map (6.59) and using (6.55), 
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g? d = 
Ry) = Dlpp’] |= FL lon (lUz 


where pe ) is as in (6.56) and |¢) = XO vmili). Finally, from Proposi- 
i=1 
tion 6.3.8, it results 


E o [Ma(C), A] = E pea [Me (C), Ma(C)] = S (o9) - 


In this way one can use the results of [298] to extend the computation of 
Epo) [M (C), A] to those values of F € [8/9, 1], where the methods employed 
in Example 6.3.8.3 are useless. 


Trace-distance and Fidelities 


In this section we review some mathematical techniques that are used to 
compare two quantum states of a system S; the importance of such an is- 
sue will become apparent in the next chapter when we shall deal with the 
compression and retrieval of strings of qubits . We shall assume S to be an 
N-level system. 


Definition 6.3.4. Given p12 E€ S(S), their trace-distance is given by 
1 
D(p1, p2) = 5 Tr|p1 — pal - (6.66) 


Namely, the trace distance of two density matrices is defined as half the 
trace-norm ||p1 — p2||zr of their difference: D(p1, p2) is a proper distance on 
the state-space S(S). 


Proposition 6.3.9. The trace distance enjoys the following properties: 
1. Let P € My(C) be any orthogonal projector, then 
D(p1, p2) = max Tr(P(p1 — p2)) - (6.67) 


2. The trace-distance monotonically decreases under completely positive 
trace-preserving maps F : S(S) +> S(S): 


D(F[p1],Fle2]) < D(P1, p2) - (6.68) 


3. The trace-distance is jointly convex: 
DLS Api X Ages) < XO GD (Pies) (6.69) 
J j Jj 


where Aj > 0 and X`; Aj =1. 
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Proof: As seen in Example 5.5.6, p1 — p2 = A— B and |p; — p2| = A+B, 
with A, B positive orthogonal matrices A, B > 0, AB = 0, so that TrA = TrB 
since Trp1,2 = 1. Thus D(p1, p2) = TrA = TrB. Let P be any projector, then 


Tr(P(A — B)) < Tr(PA) < TrA = D(p1, po) , 


for Tr(PB) is a positive quantity and P projects onto a subspace. Further, if 
this subspace supports A, it annihilates B and the maximum is achieved. 

The second property is proved as follows: let P be the projector which 
achieves the trace distance D(F[p1], Flp2]), then, because of the assumed 
trace-preserving character of F, 


D(p,0) = TrA = TrF[A] > Tr(PF[A]) > Tr(PF[A]) — Tr(PF[B]) 
= Tr(PF[A — B]) = Tr(P(F[p1] — F[p2])) = D(F[e], Flee) - 


In order to introduce some useful notions of fidelity, let us begin with 
a simple observation: the closer two vector states %1,2 € H = C to each 
other, the closer to 1 is |( 1 | wW2 )|. Indeed, the latter quantity is 1 iff y = ¢ 
(a part for an overall multiplicative phase) and vanishes when y L ¢. This 
idea extends to density matrices of an N level system as follows. 


Definition 6.3.5 (Fidelity). The fidelity of two density matrices pi. € 
S(S) is 


F(p1, p2) = Try V/pipaV/pi = Try/ (VPV Pr) (VPP) 


= Tr|Ymvpil - (6.70) 
If py = Jay) (dy | =: Pi then VP =P so that 
F(P1, p2) = v (41 | p2 |Y) - (6.71) 


Thus if p2 = | Y2) (Y2 | =: Po, then F(P,, P2) = K Y1 | Y2). 


Proposition 6.3.10. The fidelity enjoys the following properties: 
1. Let |wPV) be a purification of p € S(S) of the form 


wv) 7 Uli) @V|i) , (6.72) 
i=1 
where {|i)}; is an orthonormal basis in H = C and U any par- 


tial isometry such that UU projects onto the orthogonal complement of 
Ker(p) and V is any unitary matriz. Then, 


Flon, pa) = pag Kap Lar) 


, (6.73) 


that is the fidelity is the largest such scalar product achievable by fixing 
one purification and varying the other. 
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2. The fidelity does not depend on the order of its arguments. Further, 
Fp, p2) = 1 if and only if pı = p2, otherwise 0 < F(p1, p2) < 1. 
3. Let E = {Ei }ier denote any POVM with elements in My (C), then 


F (pi, p2) = max{ >> Tr(piF;) (Tr(p2 Ei) : Fi € El $ (6.74) 
icl 


4. The fidelity is jointly concave, namely if p12 = >; roi 2 with0 <A < 1, 
S; Ai = 1 and o}? € S(S), 


F(p1,p2) > XC Ai F(0o}, 0?) . (6.75) 


5. The fidelity monotonically increases under the action of trace-preserving 
completely positive maps F : S(S) > S(S): 


F(Flo),Flos]) = Flor, p2) « (6.76) 


Proof: It is easy to check that Tr,| YYY )(WYY | = p, so that (6.72) is a 
purification of the mixed state p. One computes, 


N 
Kopy wins | = XO (aU Moai lj) (il ViVa |) 
i, j=1 


= Tr(U] Vav aU (VVA) ) < IVVP = F(o1,p2) , 


where T means transposition. Further, the upper bound is achieved by choos- 
ing V2 = Vı and Uz = WÝU: with W such that Jp2/P1 = W\,/p2,/Pil- 
From the previous point, the second point follows at once. 


One expects a relation between trace-distance and fidelity of the kind: the 
smaller the trace-distance, the closer to 1 the fidelity. That this is indeed so 
is the content of the following 
Proposition 6.3.11. Given pı,2 E€ S(S), the following bounds hold 


1— F(p, p2) < D(p1, p2) < V1- F?(p1, p2) - (6.77) 


The following proposition establishes that if pı € S(S) is close to p2 in the 
sense that F (p1, p2) ~ 1 while while F(p2, p3) ~ 0, then also F(p1, p3) = 0. 


Proposition 6.3.12. /19] Let pi.2,3 € S(S), then Fij := F°? (pi, pj) satisfy 


Fis < Fo3 + 2 (1 — Fiz) + 2V (1 — Fiz) Fos , (6.78) 


where Trp3 = 1, but Trpy2 < 1 (subnormalization). 
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Proof: Notice that subnormalization does not alter either the definition of 
fidelity or the first property in Proposition (6.3.10). Let then | Y; } be a fixed 
purification of p1, choose | Y2) in order to achieve Fiz and Fig. Further, 
adjust the phases of the three vectors so that 


Fig = (W |2)? , Fis = (W |93)? , Fas > (Pa| Y3)’. 
Setting |W) := |W) —|%), one estimates 
(|) = (Ui |Ua) + (2/92) — 2 (i |22) < 2(1- VF), 


for subnormalization gives ( W12 | Y1,2) < 1. Then, from (W; |) = Trp3 = 1 
and the bound 


VF = (Wi |2) = (Wo | V3) + (W| Y3) < Fos + |( |% )| 


< Fos bf (|e) < Pea + y2 — VFiz) , 


the result follows. 


Let p € S(S) correspond to a mixture {Aj, pj}, P = D1; Ajpj, subjected 
to the action of a trace-preserving completely positive map F : S(S) > S(S). 
Then would like to keep track of how much F[p] differs form p in the mean: 
this is well described by 


Definition 6.3.6 (Ensemble Fidelity). The ensemble fidelity relative to a 
mixture {A;,p;} and a completely positive action F is defined as the ensemble 
average of square fideltties, 


Foo {Aj pa}sF) = D A F? (oF) (6.79) 


We shall also denote by 
F.(p,F) := sup{ Fav({\j, Pj} F) : PP = Pj) = Pf} (6.80) 


the supremum of the ensemble fidelities over all possible decompositions of p 
as a mixture of pure states. 


Example 6.3.10. [19] Let |i) € H = C3, i = 1,2,3, be an othonormal basis 
and consider the mixture represented by 


p = pi| V1) (v1 | + pol dr )( 1 | + p3l 3 )(%3|, where 
|Y) := cosa|1)+sina|2) , |v.) :=sina|1)+cosa|2) 


and | w3) is such that 


314 6 Quantum Information Theory 


(v3 | v1) = (03 | v2) = (v1 | Y2) = sin2a. 


Suppose the system is subjected to a trace-preserving completely positive 
map such that 


lds Yb = 11, ll ba) (a2 = 12902 
lls es Il = Slvr) dil + 512 (vel (6.81) 


The purification of a state p € S(S) actually couples S to an ancilla and 
this coupling is embodied by an entangled pure state |W, ). The following 
fidelity reflects how much the action of an operation on S described by a 
completely positive trace-preserving map F : S(S) ++ S(S) preserves this 
entanglement. 


Definition 6.3.7 (Entanglement Fidelity). The entanglement fidelity of 
p relative to F is defined by the square fidelity of the states |W,)(W,| and 
F @ id[|Y%,)(Y% |], where |W,) is any purification of p: 


Fent(p,F) = F*(|¥p)(p |,F @ idl )(% I) - (6.82) 
Relations between these various fidelities are as follows [224]. 


Proposition 6.3.13. 
0 < Fent(p,F) < Fav < F(p,Flol) <1, (6.83) 


where Fay is any ensemble fidelity corresponding to a decomposition of p. 


Bibliographical Notes 


The literature on quantum information, communication and computation is 
vast and still growing. There are nowadays various introductions to quantum 
information and related topics: the most extended ones can be found in the 
books [48, 49, 224] and the lecture notes [242]. 

The book [165] offers a detailed introduction to quantum computation, 
while more mathematically oriented presentations are to be found in [145, 
239]. 

As far as entanglement theory is concerned the review [152] provides an 
exhaustive and up-to-date overview of the subject in its manifold aspects 
together with a complete list of references to the relevant literature. 

The book [71] collects a series of reviews from various experts in the 
field where one can find relevant information about many of the topics dis- 
cussed or just touched upon in this chapter; among others, Gaussian states, 
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entanglement measures, bound entanglement, continuous variable systems, 
quantum algorithms, quantum cryptography and one-way quantum compu- 
tation. Furthermore, the second half of the book provides an overview of the 
state-of-the-art concerning experimental and technological implementations. 

A thorough collection of recent results of quantum information and en- 
tanglement theory concerning continuous variable atomic and optical systems 
can be found in [76, 2]. 

As regards the role and use of Gaussian states in quantum information 
theory see also [115]. 

Rapid introductions to the basic tools of quantum information are pro- 
vided in [100, 188] 


7 Quantum Mechanics of Infinite Degrees of 
Freedom 


Quantum systems with infinite degrees of freedom exhibit properties, like 
relaxation to equilibrium, phase-transitions and the existence of inequivalent 
representations of the CAR and CCR, that can satisfactorily be dealt with 
by means of the methods and techniques of algebraic quantum statistical me- 
chanics [108, 64, 65]. The point of departure from standard quantum mechan- 
ics, is that in an infinite dimensional context one is usually provided with the 
algebraic properties of the relevant observables, but, in general, not with an 
a priori given representation on a Hilbert space; the latter rather depends on 
to the physical properties of the systems under consideration [274, 290, 291]. 


Relaxation to Equilibrium 


As discussed in Remark 2.1.3.4, by discretizing chaotic classical systems, 
properties like the exponential growth of errors or a constant entropy pro- 
duction can survive only over times that scale logarithmically with respect 
to the discretization parameter. Indeed, beyond this time-scale, due to the 
finite number of allowed states, quasi-periodicity and recursion appear. While 
in classical dynamical systems, recursion can be eliminated by going to the 
continuum, this is impossible in quantum mechanics because of the intrinsic 
discretization of phase-space, due to h > 0 and to the Heisenberg uncertainty 
relations. However, recursion times can be made longer and longer by letting 
the number of degrees of freedom go to infinity [212, 300]. 

If we let N — co in Example 5.6.1.2, the recurrence time diverges and, 
unlike for finitely many spins, infinite spin chains may exhibit relaxation to 
equilibrium. Indeed, observe that 


(W-D/2 0, (N=1)/2 z 
fn(0,t) := II cos 57 = II COS SEFT 
i=1 i=1 
(N+1)/2 


2t =N +1)/24 
= J| wł- fn (0, 2t) ; 


F 2 cost 
then, when N — oo, fn(0,t) tends to a function f(t) which satisfies 
foo(t) cost = f..(2t) together with f..(0) = 1. By expanding both mem- 
bers of the first equality and comparing equal powers in t, it turns out that 
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int 
foolt) = —; (5.173) thus becomes 


sint 


CoQ j — ,2itBy; 
t = j 
p8 (oå (6) = Bm P 


As a consequence, when t — oo, the time-dependent state p?° defined on the 
infinite spin array by the expectations 


oh, > pP°(o,) = p2*(04,(t)) , 


tends to the state wo. such that woo(o7) = 1/2 and wo.(o4.) = 0 for all j. 


Inequivalent representations 


One of the most relevant aspects of quantum mechanics with infinite de- 
grees of freedom is the existence of inequivalent irreducible representations 
of a same algebra; this fact explains physical phenomena such as symmetry 
breaking and phase-transitions [290, 291, 274, 275]. 

We let N — œo in Example 5.4.1 and set 


| vac); := |0)8% , [a y= [[ o2 1 vac) (7.1) 
j=l 

|vac), := |1)8% , a0) y = [[ of vac), ; (7.2) 
j=l 


where i™ = iiiz: -ip with ij EN. 

The physical interpretation is straightforward: | vac), represent configu- 
rations consisting of infinitely many spins all pointing up, respectively down, 
whereas the vector states | ¿™ )t,, describe local configurations that are ob- 
tained from | vac);,, by flipping the spins at the sites specified by 1, i2,...,%n 
by means of the raising and lowering operators o+. By defining the scalar 
product of infinite tensor products of vectors as infinite products of scalar 
products of vectors at single sites [300], one gets 


Oe p= nm [tae PEt 1M 2 =O: 
é=1 


Indeed, in the first scalar product, outside a local region where the spins may 
be flipped, there are infinitely many spins all pointing up (down). Therefore, 
the value of the scalar product is determined by the spins within the region 
where they are flipped: it is 0 unless the flipped spins on both sides of the 
scalar product match each other. On the contrary, in the second scalar prod- 
uct there are always (infinitely many) spins in | 7“) that are orthogonal to 
the ones at the corresponding sites in | i }. 
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It thus follows that the completions of the linear spans of all vectors of the 
form |”) )t, respectively |i )| give rise to two orthogonal Hilbert spaces 
Hy, respectively H. Furthermore, let A be the (local) algebra generated 
by all products of Pauli matrices as in (5.61); the two Hilbert spaces then 
provide two irreducible, inequivalent representations 7, | (A). In fact, suppose 
X c B(H;) commutes with 7;(A), then 


q 
(4 [|X [G™ Jy = + (vac| IE ir | [ 0% X |vac), 


r=1 s=1 


= 1(vac| X vac} (6 j), 


for Io gr ie o"|vac); = 0 unless raising and lowering operators match 
r=1 
each aier Thus, X acts as +(vac|X |vac); 1 on a dense set of H}, whence 
X = ;(vac|X |vac); 1 and the commutant of 7; (A) is trivial. 
Consider now the magnetization m(N) relative to the first N spins, that 
is the operator-valued vector m(N) of components 


N 
Nea X o, t= 129, 
n=1 


and the average magnetization m = (m1, M2, m3) given by the formal limits 


_ m(N) 
i= l l i 
Choose N > jn > im, 1 < i1 < ji, then (k =f, |) 
jm 
K(i [ma (N) [j Jk = EUN + in — fm) + u Y uA [08 0 Ja 
l=in 
xfs aC NL Yn = ME Jopa l)a 


L=i,, 


for 03/0) =|0), o3|1) = —|1), while (0|o1,2|0) = (1]o1,2|1) = 0. Then, 


. n m k 
„ik Tau diy i Ye = Se a 


N 
lim (a) | 420) 


s(m)\ — 
N—--+00 N |j jk = 


Therefore, the mean magnetization m exists as a weak-limit (see (5.8)), that 
is with respect to the weak-operator topology determined by the representa- 
tions 77, (A) on H}. It thus depends on the representation with respect to 
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which the limit is calculated and belongs to the bicommutant 7;,;(A)” (see 
Definition 5.3.1) where mp = (0,0, u), mı = (0,0, —p). 

If m}, (A) were unitarily equivalent, then there would exist a unitary 
operator U : H} +> Hy such that Utm (A)U = 7(A); if so, it could be 
extended by continuity to the weak closures m+, | (A)”. Then, its action on 
the mean magnetization would give rise to a contradiction: 


-u = m3 = Ul mog U = pUlU =p. 


The vectors | vac);,; behave as vacuum vectors for the spin algebra A; indeed, 
|vac);,; and the representations 7}, | (A) are unitarily equivalent to the GNS 
representations based on the expectation functionals 


Pp q P q 
o (J of [J o%) = rilvacl || of [] o% wach. 
r=1 s=1 


Other interesting representations of A can be obtained by means of the 
following expectation functional 


Ws (i i) = [[ Bo.) ; (7.4) 
é=1 f=1 


where p is the spin density matrix of Example 5.5.5 and o} is the jp Pauli 
matrix at site iz. The corresponding GNS vector | 2, ) can be identified with 
the infinite tensor product of the vector states resulting from purifying p: 


12.) = @va)=@(y/25210) @10) + fF 4)1) @11)) 


Further, the GNS representation 7,,(A) can be identified with the infinite 
tensor product 


m3(A) = €) T (M2(C)) = (Malo) g 1) l 


n 


The von Neumann algebra 7,(A)” is not irreducible, but it is a factor since 
1 
the commutant is 7,(A)! = Qi ® M,(C)). Since Se] 0) = 0, the 


projection A 3 Py := JI = is such that Py| i”) Yo = 0 if the sites 
i=N 
i1,i2,...,in € [N,2N], else Py|i™ Jo = | i™ Jo. 

Given |~); € H} and e > 0, one can find a vector |¢); in the sub- 
set linearly spanned by [a ); indexed by the sites within a suitable finite 
interval I. such that |||); —|¢);|| < £; then, by choosing N such that 
I; N [N,2N] = Ø, one estimates 
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(Pa — Diyil < Px — De) i + 21): —1 eda ll S 2e. 


Therefore, Py — 1 strongly on Hy. Instead, Tr(po3) = s yields 


. _ (1t+s\" fi s=1 
lim | ws(Py) = lim ( 5 ) a G2 yet 


Therefore, for 0 < s < 1, the GNS representation 7,(A) cannot be unitarily 
equivalent to 7(A). If so, there would exist an isometry U : H, — H} such 
that 


0= lim (2,|7s(Pw)|Q.) = lim (2,|U'm}(Py)U|Q,) =1. 


N-+00 N—-+00 


In fact, U| 2; ) € H} and Py converges strongly and thus weakly on H}. 


Factor Types 


According to Example 5.6.2, the states ws on the spin algebra A may be 
interpreted as thermal spin states at inverse temperature 


1, l+s 
— lo . 
Tie 


Bs= 


— the zero temperature state w; is equivalent to the vacuum state wy; 

— wo is an infinite temperature state with the properties of a tracial state 
(compare (5.55)) such that wo( XY) = wo(Y X) for all X,Y € A; 

— for0<s<1,w, is a thermal state with no specific properties. 


Correspondingly, the von Neumann algebras 7” (A) that arise from the strong 
closures of the spin algebras ms (A) on the GNS Hilbert spaces H, are instances 
of the so-called factors of type I, II and ITT [300]. 

The classification of von Neumann algebras starts by considering different 
possible classes of their projections. A projection p = p = p? of a von Neu- 
mann algebra A is called an Abelian projection if pM p is Abelian. Typical 
examples in this class are the minimal projections of Example 5.3.4.1: if p € A 
is a minimal projection and q € A is another projection, then 0 < pqp < p. 
Therefore, pgp = Ap as all spectral projections of pgp are < p and must then 
be equal to p. Since A is generated by its projections q, p Ap is Abelian. 

Two projectors p,q are said to be equivalent, p ~ q, if there exists U € A 
such UU = P and UU? = q. This is the case for the initial and range 
projections in the polar decomposition (see Remark 5.2.2). A projection p € A 
is said to be finite if A > q = q* = q? < p and q ~ p imply => q = p. Any 
Abelian projection p € A is finite; in fact, let q < p and p = U'U, q = UUH. 
Then, since q < p => qp = pq = q (see Example 5.3.4.1), it turns out that 
V := pUp = pqU = qU and VÝ commute so that 


Vİ V = UtqU = pP = p=VVÝ = qUUt q =q. 
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1. A unital von Neumann algebra A is said to be of type J if its identity 1 can 
be decomposed into an orthogonal sum of Abelian projections pp E€ A. 
Typical examples are A := L% (X) and B(H) with H a separable Hilbert 
space. In the first case, the characteristic functions of the atoms of any 
partition of Æ into disjoint measurable atoms are the required Abelian 
projections. In the second case, the projections pn = |n)(n| onto the 
orthonormal vectors {|n }}nen of any ONB are Abelian and sum up to 
the identity. Then, also A @ B(H) is of type I, the required Abelian 
projections being given by 14 8 pn. 

2. A unital von Neumann algebra is said to be finite if its identity is a finite 
projection, semi-finite if its identity can be decomposed into an orthog- 
onal sum of finite projections. Since the projections pn in the previous 
point are minimal, type J von Neumann algebras on infinite dimensional 
Hilbert spaces are semi-finite, finite if dim(H) = n. 

3. A unital von Neumann algebra is said to be of type IJ if it is semi-finite, 
but does not contain any non-zero Abelian projection; of type IIT if it 
does not contain any finite projection. 


The trace for finite dimensional systems (see (5.19)) is a particular real- 
ization of the following general notion. 


Definition 7.0.8 (Traces). /300] A trace on a von Neumann algebra A is 
a map ®: A, ++ R, from its positive elements into the positive reals Ry 
such that 


BALAA) = JOASA) VALE Ry ,, Ai E€ Ay 


@(A)=G(U'TAA) VAEAL,UEA unitary. 
The trace is 


— faithful if B(A) = 0 = = A=0 for AE Ay; 

— finite if B(A) < +00 for all A € A}; 

— semi-finite if for all A € Ay there exists Ay 5 B < A with P(B) < +00; 
- normal if sup (Aq) = (sup Aq) for every increasing net {Aa} C Ax. 


Analogously to what has been proved for states (see Example 5.3.2.3), it 
turns out that if 6 < W for two faithful, semi-finite traces on A, then there 
exists 0 < X’ < 1 in the center Z = AN A’ such that (A) = W(X"A) for 
all A € A, [300]. Then, if A is a factor, Z = {A1} and all traces on it are 
proportional; in fact, given any two traces ®;, i = 1,2, 


Pı < Bi +P —> Pi = di (O1 + Bo), Ba < G+ hy => Pa = Aa(P1 +82), 


whence Pı = Airy 82. 

Consequently, any chosen trace on a factor von Neumann algebra can 
be used to assign its projections an intrinsic dimension, thus providing a 
characterization of types [162, 117, 300]: 
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1. Factors of type I have a semi-finite, faithful, normal trace given by (5.19) 
whose range on projections is discrete and finite for finite type J, factors, 
or countable for infinite type I» factors: an example of the latter case is 
the spin algebra I7,(A)” with respect to the zero temperature state w1; 

2. factors of type IJ have a semi-finite, faithful, normal trace whose range 
on projections is the whole interval [0,1] for finite type IJ; factors or the 
whole of R, for infinite type JI% factors. An instance of the first occur- 
rence is the spin algebra 7,(A)” with respect to the infinite temperature 
state ws when s = 0; 

3. finally, type III factors have no semi-finite, faithful, normal traces: this 
is the case of the spin algebra ms( A)” when 0 < s < 1. 


7.1 Observables, States and Dynamics 


The physical scenario in the examples discussed in the previous section is a 
common one in quantum statistical mechanics. Indeed, the limit of infinitely 
many degrees of freedom is in general achieved in the so-called thermodynam- 
ical limit, where one starts with N particles in a finite volume V C R? (or 
Z? in the case of a lattice system) and lets N,V — oo in such a way that 
N/V + p, where p > 0 is a given spatial density. 

Each V C R? has its own Hilbert space Hy = L4, (V) of Lebesgue square- 
summable functions and the corresponding C* algebra Ay = B(Hy) of 
bounded operators. Instead, in the case of a lattice system, each æ € V carries 
a Hilbert space H, and the C* algebra Az := B(Hz), so that Hy = pey He 
and Ay = pey Aæ- Notice that in the continuous case each Hilbert space 
Hy is infinite dimensional, while in the discrete case it depends on whether 
the Hilbert spaces H, at the lattice sites are finite dimensional or not. If 
Vi C V2, set Vig := Vz \Vi, then Hy, = Hy, ® Hye and Ay, becomes a 
subalgebra of Ay, by embedding any A; € Ay, into Ay, as A; Q Ive, where 
Ive, denotes the identity operator on the Hilbert space Hy... It follows that 
the set Ap := Uy Av is a «algebra, namely it is closed under addition and 
multiplication of its elements; also, it is naturally endowed with the norm 
Ay > X > ||X|| for all V c R8. 


Definition 7.1.1 (Quasi-Local C* algebras). The normed x-algebra Ao 

l-l 

is the algebra of local observables, while its norm-closure A := J Av is 
V 


known as a quasi-local C* algebra. 


Remark 7.1.1. The notion of quasi-local algebra is physically motivated 
by the fact that the only experimentally accessible observables of infinitely 
extended quantum systems are the local ones. These can then be used to 
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approximate as much as one desires the non-local ones. From a mathematical 
point of view, the construction is an instance of inductive limit [117] of a 
directed net of C* algebras. In the case of an increasing sequence {An, }n,en 
of finite-dimensional C* algebras An, C An,,, the generated quasi-local C* 
algebra is called Almost Finite (AF), if the algebras An, are full matrix 
algebras M,,,(C) then A is known as Uniformly Hyperfinite (UHF) [277, 244, 
245]. 

Suppose an increasing sequence of finite-dimensional unital C* alge- 
bras {An,;}n; is represented on a Hilbert space H, then the strong closure 
of Un, An; is a von Neumann algebra M which is called Hyperfinite. An 
Abelian instance of such an algebra is the von Neumann algebra of essen- 
tially bounded functions, L% (æ), which one can generate by means of the 
characteristic functions of finer and finer finite partitions of VY as explained 
in Remark 2.2.3.4. 


Example 7.1.1. [10] Let M,,,(C) C M,,(C) be two matrix algebras; given 
a system of matrix units E a for the smaller one (see (5.12)), the or- 
thogonal projections EY) sum up to the identity >)" ED = = lz € M,,(C), 


whence )>;2, Tra(EQ)) = = n2, where Trz denotes the trace computed with 
respect to the Hilbert space C”2. But then, using the cyclicity of the trace, 


Tro(Et?) = Tre(E? BD) = Tra( ES BO) = Tre(E) , 


for all k,p = 1,2,...,n1, whence ng = nı x d, where d := Tro (E0) for all 
k =1,2,...,n,. Let {| fi) € C”2}2_] be an ONB in the subspace projected 
out by EY and set 


2 1 1 
Bete Teale len (7.5) 


Since 1 < ky, jı < nı while 1 < ko, jo < d, these are n? x d? = n3 matrices 
in Mn, (C); moreover, from (5.12) and EY | fo) =| fp) it follows that 


(2) (2) — p) (1) p(1) (1) 
ECs Jkea)s(i sda) E piana) = Ei | fro ) fia | Eiji Es 11 | Sp. ) fa | Eig 
= ĝjıpı Eha lfa) fin Eat faele 


1 
= ôjı pı Ôj2p2 E Ne ‘| fro M faz (EQ 


(2) 
= Ö;jipi Ojo. Bie. koitang) ` 


Thus, (7.5) defines a set of matrix units in Mn, (C). Set Bf j, := | fra )( fis |; 


then, P ka)s(disj2) CAN be isomorphically represented by a ® Eğ j 


C2 = C™ Q C4, Therefore Mn, (C) is isomorphic to Mn, (C) & Ma(C). The 
matrix algebras M,,,(C) C Mn, (C) that generate a UHF algebra A must 


on 


i+1 
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be such that any n; must divide the subsequent one so that A is isomorphic 
to an infinite tensor product of matrix algebras. The simplest instance of 
UHF algebra A is a quantum spin chain (see Section 7.1.5). In the case of 
the previously discussed infinite spin system, A is the quasi-local algebra A 
generated by the local algebras Aj_;.4) = Qi p(M2(C))e which are tensor 
products of 2 x 2 matrix algebras at each lattice site. 


7.1.1 Bosons and Fermions 


Physical systems of quantum statistical mechanics usually consist of indistin- 
guishable particles and are described by operators of creation and annihila- 
tion satisfying either the CAR (5.62) or the CCR (5.92). More precisely, one 
considers the Fock representation built upon the existence of a distinguished 
vacuum vector | vac) (which was considered in Examples 5.6.2.1,2 for finitely 
many degrees of freedom). 

Let H be the Hilbert space describing a single Fermion or Boson and let 
{| wi) }ien be an ONB. Then, one introduces operators 


ai := a(i) such that a(yi)| vac) =0 VieN 
al := al (yi) such that a!|vac) =| yi). 


l 


They are required to satisfy the CAR (5.62) if the particles are Fermions, 
the CCR (5.92) if Bosons. 

By expanding any |Y) € H along the chosen ONB , |y) = 90, cil Yi), 
one can consistently define creation and annihilation operators of generic 


|Y) €H: 
al (y) =J cal , a(l) =) ġa. 


This yields a(1)| vac) = 0, at (Ņ)| vac) = |W) and 
[a(v), ato] = [to ato] = 0, fa), atto] = (414) (CoR) 
{a(s), ald) } = {at(d), ato) }=0 , Saly), at(4)} = (wld) (CAR), 
for all |Y), |¢) € H. Furthermore, by using these algebraic relations one gets 


a(th)|b) = a(y)atġ| vac) = (Y |ġ)| vac) . (7.6) 
For both Fermions and Bosons the number operator is defined by 
N= 5 al Qi . 
iEN 


Directly for Bosons and by means of 
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[alai , a (f)] = aj{ai, al (f)} — {a} , al(f)}ai = fia} , (7.7) 
where fi := (Yi | f ), for Fermions, one finds that 
IN, al(fl=al(f),  [N,a(f)]=—a(f). (7.8) 


The Fock space Hg for Fermions, respectively Bosons is generated by the 
completion of the linear span of vectors of the form P(a( f), at (g))| vac) where 
P(a(f),a'(g)) is any polynomial in Fermi, respectively Bose annihilation and 
creation operators. The Fermi operators a*(f) are bounded on the Fock 
space; indeed, from the CAR it follows that, for any normalized |W) € Hp, 


lla" A) I? + al (AY DIP = IFIP - 


The polynomials P(a(f),a'(g) with f,g € Hy, where V is a finite volume, 
generate, by norm completion, a local C*-algebra, AS. 


Remark 7.1.2. Given two volumes Vı C V2 C R, the local Fermi algebra 
cannot be isomorphic to Af, Æ Af. Q Af,» where V3 := V2 \ V2. In fact, if 
| f) € Hy, and |g) € Hyg, despite the fact that ( f |g) = 0, commutators of 
the form [a*(f) , a#(g)| need not vanish. However, because of (5.63), commu- 
tators vanish if one considers polynomial with even numbers of creation and 
annihilation operators: the quasi-local C* algebra they generate is denoted by 
AC. The quasi-local algebra O* generated by polynomial with a same number 
of creation and annihilation operators is denoted by AË; it is known as even 
Fermi algebra and commutes with the number operator. Indeed, using (7.8), 
it turns out that the number operator generates the gauge-transformation 


cio atep eeN = 3 EM tw, te LW, aI 
+ er a re 


k! 
= k times 
+00 7. nk 
~ = ames ae at] ---J] 
m i k—1 times 
— = (ia) OIE E 
=A S setal) (T9) 
j k=0 d 
of at (f) eN =e a(f) = alel*f) . (7.10) 


Therefore, the various phases compensate each other in polynomials with 
equal numbers of a and at; these are thus left invariant by the gauge- 
transformation for any a € R and must therefore commute with the number 
operator N. 

For Bosons, |a” (f1), a*(fo)] = 0 if f; € Hy, and Vi N Vz = 0; however, 
the operators a*(f) cannot be bounded (see (5.69)) In order to construct 
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local C* algebras generating a Bose quasi-local algebra AË, one associates 
to |v) € H the bounded operators [108] 


a) + a 


W(w) := exp (i 5 (7.11) 


which generalize the Weyl operators (5.96) and linearly generate a local 
Bosonic C* subalgebra AB by choosing % supported within the volume V. 


The C* algebras Ag, p are irreducibly represented on the Fock spaces 
Hpg,r. In order to show this one can use a similar argument as for the spin 
algebras 77, (A) discussed in the previous section. If X belongs to the com- 
mutant, X € A’, p, then (7.6) yields 


(vac|algn) a(g) X at( fi) -a (fm) vac) = 
= (vac| X a(gn)+-a(gi)at (ft) +a (fm) [vac ) 
= (vac| X vac) (vac| a(gn) ++ a(gı)at (f1) a? (fin) lvac) , 


where (see (5.185) and (5.183)) 


(vc aan) alah) al Fup leae) = CBee a) CAR 


(7.12) 
Therefore, X = (vac| X |vac) 1 whence the commutant is trivial; this means 
(see Lemma 5.3.2) that Agr are irreducibly represented. 


Quasi-free Automorphisms and Quasi-free States 


The gauge-transformation (7.9) is a particularly simple example of quasi-free 
automorphism. 


Definition 7.1.2. Every single particle unitary transformation U : H — H 
gives rise to a quasi-free automorphism on AP? given by 


Oula*(f)] =a" (Uf) . (7.13) 


Quasi-free automorphisms are typical time-evolutions of non-interacting 
particles possibly subjected to external potentials. They preserve the num- 
ber operator; indeed, by expanding U| fi) = >> j cij| fj) with respect to the 
chosen ONB, it turns out that 


O(N] = So al(U fia (Ufi) = ac cipal ak = Soa) “(bjs 


iEN i,j,k jEN 


for the matrix C = [c;;] is unitary. 
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Examples 7.1.2. 


1. 


Let {Ux}eers be the unitary group of space-translations, 


|f) => Ual f) =| fe);  (r|fæ) =fr- x), 


for all f € L3, (R?); then, {Oz}ecs is the group of space-translation 
automorphisms of AP:®: 


O2[a*(f)] = a* (fe) - (7.14) 


. Let h : H + H be a single-particle Hamiltonian with discrete spec- 


trum, h = Jo, cilyi) yi] being its spectral decomposition, and set 
af := a#(q;). Therefore, the basic annihilation and creation operators 
annihilate and create single-particle energy eigenvectors. Consider the 
second-quantized Hamiltonian H = €; alai and the generated one- 
parameter group of automorphisms O := {0; her, 


a# (f) = Oila” (f)] := e” a# (f) e7“. 


By expanding and summing as in Remark 7.1.2 one finds that, for both 
Bosons and Fermions, 


et aff) et" =ale ts), eH at(f) eH sales), (7.15) 
for all a € C. Therefore, 
Oila” (f)] are" f) , (7.16) 


whence the group O is a quasi-free time-evolution. 


. Let us consider a single-particle Hamiltonian h with an absolutely con- 


tinuous spectrum and the corresponding quasi-free time-evolution (7.16). 
For instance, the free-time evolution given by (p|h|f) = p?/2m)f(p) 
in momentum representation. 

In this cases, one can use the so-called Riemann-Lebesgue Lemma [258]. 
For an integrable function f : R — C with integrable first derivative, it 
follows from integration by parts: 

ei” t 
E 


fw te) cirt feUe 
= a fo)” —0 


when t — +oo. Because of the assumed absolute continuity of the spec- 
trum of the single-particle Hamiltonian h, this lemma ensures that 


lim I[a(f), a(g = lim |( fle g)| =0 (7.17) 


t— 00 =E 
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for all f,g € H for Bosons and 


lim I{a(f), a(g} = lim fleg) = 0 (7.18) 


t— +00 


for Fermions. While Bosonic annihilation and creation operators commute 
asymptotically in time, Fermionic ones anticommute. However, by means 
of (7.7), one computes 


[a(f)a( fo), al (e'™g)] = alfa) (Faleg) — (Faleg) af fa) . 


Then, ||[a(f1)a( f2), at(e’"*g)]|| — 0 when t — o0; furthermore, the 
same asymptotic commutativity in time holds for [X , O:[Y]] where X 
is an even polynomial in a,a and Y any polynomial. By continuity, it 
extends to all X belonging to the even Fermi algebra A© and all Y € AF. 
Moreover, this result holds for all quasi-free automorphisms 0,(a*(f) = 
a(U;f) over AC consisting of a discrete or continuous group {U;}+er.z of 
single-particle unitaries U, : H +> H such that lim;+..( f | Utg) = 0 for 
all f,g € H. This phenomenon is known as asymptotic Abelianess. 


Definition 7.1.3. /65, 108] A quasi-free state on AP is any linear functional 
wa such that w(1) = 1 and 


walW(b)) = exp (=F (| (1+ 24) p) , (7.19) 


where 0 < A € B(H) is a positive bounded operator on the single particle 
Hilbert space H = L2,, (R3). 

A quasi-free state on AF is any linear functional wa such that wa(l) = 1 
and 


walat (fm) ---al(fi)a(gr)-+-a(gn)) = nm Det[( gil Alfi)], (7-20) 
where 0 < A < 1 € B(H) is a single particle operator on H = L3, (R3). 


The Fock vacuum satisfying (7.12) is the simplest instance of a quasi-free 
state; the one with A = 0. Like classical Gaussian states, quasi-free states 
also can be reconstructed from their two-point correlation functions 


wa(al(f)a(g)) = (g| AIF) VhgeH. (7.21) 


This property results directly from the determinant in (7.20) for Fermions, 
while for Bosons it can be proved by showing that (7.19) leads to 


wala (fm)---al(fi)a(gr)-+-@(9n)) = Snm Per[( g: | AIF; )] , (7.22) 


where the so-called permanent is as in (5.185); indeed, (7.19) is a generaliza- 
tion of (5.186) to the infinite dimensional case. 
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Example 7.1.3. [65] Suppose a quasi-free state satisfies the KMS condi- 
tions (5.179) with respect to the quasi-free time-evolution (7.16); using the 
commutation relations and (7.15), it turns out that 


walat(fya(g)) = (gl Alf) = walalo) Aigla (N) = walalg)at(e-P*)) 


= (gle |f) twalal(e** f)a(g)) 
= (glePh & Ae Ph If) 


Since this is true for all f,g € H it turns out that (compare (5.182) 
—Bh 
and (5.184)) Ax = oe where the plus sign holds for Fermions and 
Fe 
the minus sign for Bosons. 


KMS States and Modular Theory 


We have seen that Gibbs states of finite dimensional quantum systems sat- 
isfy the KMS relations (5.179). These relations can be extended to infinitely 
many degrees of freedom where they identify equilibrium states at a given 
temperature [131] (a simple instance of this fact was offered in the previous 
example). Unlike with finitely many degrees of freedom (see Remark 5.6.1.1), 
there can be more than one equilibrium state at inverse temperature 8. An 
equilibrium state is called extremal when it cannot be decomposed into a lin- 
ear convex combination of other equilibrium states at the same temperature; 
extremal equilibrium states give rise to factor representations and can be in 
some cases rightly identified as pure thermodynamical phases [300, 65]. 

Given a triplet (A,O,w), with a faithful state w. The latter is said to be 
a KMS state at inverse temperature B with respect to the automorphism O 
if the functions (compare (5.178)) 


Fxy(t):=w(@[X]Y), Gxy(t) :=w(YO[X)  YX,Y €A, 


can be extended to analytic functions F'xy (z), respectively Gxy(z), on the 
strips —8 < S(z) < 0, respectively 0 < S(z) < 8, and continuous on their 
borders, where they satisfy 


w(O[X]Y) = w(¥ Or4i[X]) . (7.23) 


We outline a few of the many properties of KMS states [300] (for a more 
detailed analysis see [65, 108]). These properties involve the GNS cyclic rep- 
resentation 7,,(A) on the GNS Hilbert space Hu, and the GNS implementation 
of O by a unitary operator U,,. 
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Remarks 7.1.3. 


1. When extended to the von Neumann algebra 7,,(A)”, a KMS state w 
remain KMS in the sense that 


(2, |X Uw (bY |Q,) = (Ru |Y US(t+1B)X |Q,) VX,Y E TA)", 
2. KMS states are O-invariant: indeed, (7.23) implies 
x(t) := w(Or[X]) = w(Or4i8[X]) = fx (t + ip) 
for all X € A. Thus, fx(t) can be periodically extended over the whole 
of C where it defines a bounded analytic function for f(t) is bounded 
on the strip —8 < S(z) < 0. Therefore, this function must be constant: 
fx(t) = fx (0), whence O;[X] = X for all X € A. 
3. The center Zu = 7(A)/N7,(A)’ of the GNS representation based on a 


KMS state w consists of 9-invariant global observables. Indeed, if T € Zo, 
by the same argument as in the previous point, the function 


fx T(t) = (Ro | ToX] TQ.) = (Qu |T Oriel X] |. ) 
= (2. | to (Oriel X] T|Q.) = fx,r(t + ip) 
can be extended to a bounded analytic function over C, for all X € A. 
Then, it must be fx,r(t) = fx,r(0). Since t € Zw, choosing X = Y+ Z, 
Y,Z € A, yields 
fyi z,r(t) = ( Nw | Tu lY) U s(t) T UL (t) Tw [Z] [Ru ) 
= fyt z,7(0) = (Ro | To (Y)! Tail 2) [2u ) 

on a dense set, whence U,,(t) T U(t) =T for allt € R. 

4. For fixed inverse temperature, the KMS states form a convex set which 
is compact in the w*-topology [300]. 

5. A KMS state is said to be extremal KMS if it cannot be written as a con- 
vex combination of other KMS states (at the same inverse temperature). 
The GNS representation based on an extremal KMS state is a factor: 
Zw = {Al}. If not, there would exist 0 < Ti2 E€ Z, with Ti +7, = 1 
which could be used to construct the states 

(Ro | Tima (X) |Qu) 

( Rw | Ti KA ) 

which turns out to be a KMS state with respect to O at inverse temper- 

ature 3. Indeed, 


( Ro | Tita (UX nY) [Qa ) 
(Ro |Ti|2u) 

(Ro |T (UX Tim, (Y) | Qu ) 
(Ro |T;|2.) 

(2. | Tita (VY) mu (O14 [X]) |20) 


E (Ru [Ti Ra) = wi(Y Oriel X) - 


wi(O[X]Y) = 
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The modular theory or Tomita-Takesaki theory, which has been intro- 
duced in its simplified finite-dimensional version in Section 5.5.1, extend to 
generic von Neumann algebras M C B(H) with a faithful state (see Defini- 
tion 5.3.2) [64]. More precisely, given a quantum triplet (A, O, w), if the GNS 
state is such that 


KX) 2s) =0 = X=0 VX ETA", 
then there exists a modular conjugation J : Ho œ Ha, such that 
Pel, Juy = O); dental Ja = nal A (7.24) 
and a modular operator A, : Ho +> Ha such that 
Jun Ay X| Ro) = Xİ 2) VX ETA). (7.25) 
Furthermore, the maps 
at i Tul A)" 3 X = At X A" (7.26) 


form a group {0t her of automorphisms, called modular group; moreover, 
they satisfy the KMS conditions 


(Ro | XY Ro) = (Ro |Y Z (X) |u) YX, Y E n(A, (7.27) 


that we will shortly write as w(XY) = w(Y o}Ż(X)). 


Example 7.1.4. If M C B(H) is an Abelian von Neumann algebra (M C 
M’) with a cyclic vector |}, then it is maximally Abelian. In fact, | 2) 
is necessarily cyclic also for the commutant M’, thence separating for the 
bicommutant M” = M (see Lemma 5.3.1). Then, (7.25) gives 


|| 7VAX].2)||? = [X+ 2)? = (2| XX+ 2) 
= (2|X'X|2) = |X| 2)? , 


for all X € M since M is Abelian. Therefore, A = 1 and JXJ = X1, 
whence (7.24) yields M = M’. 


Example 7.1.5. A most used GNS representation [300], is the so-called ther- 
mal representation whose cyclic and separating vector is the tensor product 
of two vacuum states, | 2g) = | vac) ®|vac), so that the GNS Hilbert space 
is isomorphic to the tensor product of two Fock Hilbert spaces. 

We shall consider the framework of Examples 5.6.2.2,3 without restric- 
tions on the dimensionality of the single particle Hilbert space and on the 
cardinality of the spectrum of the single particle Hamiltonian h. We shall 
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denote by a*, respectively b#%, Bose, respectively Fermi, creation and anni- 
hilation operators. In the Bose case, their action on | 2g ) is 


ma(a(f)) =: apl f) = a(./1+ A_f) @1+ l@aljV/A_f) 
ma(a'(f)) =: ab(f) = al(\/1+ A_f)@1+ lea VA f), 


where the single particle operator j : H + H is antilinear and satisfies 
(jf lig) =(g\|f), while in the Fermi case 


ma(0(f)) =: bal f) = (V I- Ay f) @1+ O@al(jVA;f) 
mg(b'(f)) =: bb (f) =b'(,/1— A, f) @1 + O@a(jV/AZf) , 


where Ø is an operator on the Fock space such that 0b# = —b*@ and 
0| vac) = | vac}. Then, 


abl fl Qe) =|vVI+A-f)8|vac), ag(f)| Na) =| vac) @ | jVA-f) 
bI 2s) =| vI- Af) @| vac), ba (f)| Ng) =| vac) @ | VAS) 


whence 


(2 | ah (faa(g) |2e) = (JV Af |i V4-9) = (g| A-If) 
(2g | bb (f)ba(9) |2a) = (iV Arf iV A9) = (gl Alf) - 
The modular operators read 


A F t F f 
"TENNE 6; a! ay + „Eini Qi i E i Ei bibi aR i Ei bibi 
z ae PLisala g etPEisala | At = eP E ea l 


where af and b” create or annihilate eigenstates |¢;) of the single-particle 
Hamiltonian h. By means of calculations similar to those that led to (7.9) 
and (7.10), one explicitly calculates 


A; 4a(f)| 2s) = | vac) |j” yA) 
Aj ba(f)| 2a) = | vac) 8 |j” Af) 


One can thus explicitly evaluate the action of the modular conjugation; 
from (7.25), 


Je aa(f)| 23) = Apart?) f)| 2g) = |e "/2,/1 + A_f) @| vac) 
=| /A-f) @| vac) 

Js ahl f) 2a) ) = /Agag(f f)| 2g) = | vac) @ | je?"/?\/A_ Ff ) 
=|vac) @ |j VIF 4-f) 


in the case of Bosons, while for Fermions one obtains 
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Ja ba(f)| Qa) = AFORE) Qs) = le-8"/2,/T— Arf) 8 | vac) 
=| VÆ f) 8 | vac) 

Js bh(f)| Qs) = y/Azdg(f)| Qs) = vac) @ jef yA f) 
=|vac) | jy I-A; f). 


Since thermal states are faithful, | 2g } is cyclic and separating, therefore 


Jp ag(f) Ja = al(./A_f) @14+1@a(j/1+4 Af) 
Jaa} (f) Ja = a(/A_f) @14+ 1@at(j/1+4A_f) 


for Bosons and, for Fermions, 


Ip bal f) Ja = (VAF) @14+ 6 @ Wj /1— Ay f) 
Ja bh(f) Ja = W(/Asf) @1+0@d'(5/1- Af). 
The thermal representation has been used in [211] to implement the trans- 
position in an infinite dimensional context and study the entanglement prop- 
erties of infinitely extended quantum systems (see also [308]), the starting 
point being (5.149) in the finite-dimensional case. Let V the flip operator 
which exchange vectors in tensor products V| 8 Y) =|w@@), then 


Vi J, X? @lyd,V =X" 81. 


Analogously, in the thermal representation one may represent the transposi- 
tion as follows 


Ye laa(f)] = Vi Jp ab(f) Ja V- = ak (G./14+ A_f) @ 1410 al(/A_f) 
i lba(f)] = Vi Ja bh (f) Ja Ve = b (jy 0 Af) ©1408 b(A f) . 


Among the CPU maps on a C* algebra A, a special role is played by 
the conditional expectations (see Definition 5.2.3). Suppose the orthogonal 
projections P; in Example 5.2.9.1 commute with a given density matrix p € 
B} (H), it then follows that 


po E(X) = Tr(pE[X]) = XL T(P; p P: X) = TÒ Pip X) = p(X) 
ic] ie 

for all X € B(H) for J;e; Pi = 1. One says that the conditional expecta- 
tion from B(H) onto the Abelian subalgebra P C B(H) generated by the P; 
respects the state p. Also, notice that if p is faithful then P is left invariant 
by the modular automorphism (5.180), that is of[P] = P. This is the key 
point how to extends these considerations to the case of general von Neu- 
mann algebras with faithful normal states, where conditional expectations 
are identified with normal projections of norm one (see Remark (5.2.7)). We 
state the result, for a proof see [293]. 
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Proposition 7.1.1. Let A be a von Neumann algebra, w a faithful normal 
state with associated modular group of automorphisms ot, , t € R. Moreover, 
let Ao C A be a von Neumann subalgebra and wo the restriction w Ao with 
associated modular automorphisms of,,. Then, the following conditions are 
equivalent: 


1. there exists a normal conditional expectation E : A +> Ag that respects 
the state, w o E = w; 

2. ot [Ao] C Ao for all t € R; 

3. ot [A] = of [A] for all A € Ao. 


7.1.2 GNS Representation and Dynamics 


Quantum dynamical systems will be identified as non-commutative algebraic 
triplets (compare the analogous commutative Definition 2.2.4). 


Definition 7.1.4. Quantum dynamical systems are triplets (A,O,w), where 
A is a C* algebra with identity 1, the dynamics O corresponds to a group of 
automorphisms O, : A= A, t E€ G, such that 


0,00, =O0,0O; = On45 , wo =w, Ys tEG, 


where G = Z or G = R and the state w : A =œ C is a normalized, positive, 
O-invariant expectation, namely w o O, =w for allt E€ G. 


Given an algebraic triplet (A, O©,w), a natural Hilbert space formulation 
is based on the GNS construction (see Definition 5.3.7); it does provide not 
only a representation 7.,(A) on a Hilbert space Ho with a cyclic invariant 
vector | 2), but also an implementation of the dynamics by a group of unitary 
operators. 


Proposition 7.1.2. /107, 300] Let (A,O,w) be a C* dynamical system and 
(Ho, nw, Ru) the associated GNS triplet, then, the C* automorphism © is 
implemented by a unique unitary operator U., : Ho > Hu, 


m(O(X)) =Uln(xX)U, VXEA. (7.28) 


Proof: Given the GNS representation Tw, 7 := Twe 0 O is another repre- 
sentation of A on Ho, such that 


(Ro | m(X) Ru) = (Ro | mo(O(X)) [Ru ) = w(O(X)) = (X) . 
Therefore, Remark 5.3.2.1 ensures the existence of a unitary operator U,, such 
that (7.28) holds. If another unitary operator W with the same properties 
exists, then [wtu Tol X)| TY )| Rs) = 0 for all Y € A, namely on a 


dense set; therefore, W'U,, belongs to 7,,(A)/ for which | 2,,) is separating 
(see Lemma 5.3.1). Then, W = Uw, since (W'U,, — 1)| 2,,) =0. 
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Remarks 7.1.4. 


1. 


If the dynamics is specified by a one-parameter group of C* automor- 
phisms {0;}rer which is weakly-continuous in the GNS representation, 
then the group Us (G) := {U.(t)}ieg, G = R, is strongly continuous on 
Ha. In fact, the previous Proposition asserts that each O; is implemented 
by a unitary operator U,,(t); furthermore, 


Ui (t)UL(s)m.(X)| Qo) = UL) m(@s(X))| Qu) = To (Or46(X))| Qo) 
= UN (t+ s)m(X)| Qo) 


on a dense set, whence the family {U.,(t)}:er forms a one-parameter 
group of unitaries on Ha. Strong continuity follows from weak continuity 
and 


(To) = Drol X)| 2)? = 2(w( XTX) - RW(XtE,(X)))) . 


. The Fock representation is unitarily equivalent to the GNS representation 


based on the vacuum state: w(a(f)) = (vac|a(f) |vac) = 0 for all | f) 
in the single-particle Hilbert space H. Suppose O is a quasi-free Fermi 
automorphism as in Example 7.1.2.3; let V : Hp + Hp be the unitary 
operator that implements it on the Fock space. Since the number oper- 
ator is left invariant by quasi-free automorphisms, if V belonged to Ap, 
then it should also belong to the even Fermi algebra Ag. Further, from 
asymptotic Abelianess, the invariance of the norm under unitary trans- 
formations and the fact that the various V; commute, it turns out that, 
for any € > 0 and X € Ap, 


IM, OsfX] JI] = IVI V: X Vs — Vi X V: Val 
= (Vix - XV =X - Vİ XVI <e 
for allt € R. This cannot be true for all X € Ap so that the unitary oper- 


ator V € B(H) does not belong to Ar. However, since Ap is irreducible 
(see (7.12)), V belongs to the bicommutant A% p = {ACF = B(Hp). 


. Very rarely, starting from the Hamiltonian of a system of N interacting 


particles and going to the thermodynamic limit, one obtains a norm- 
continuous dynamics at the C* algebraic level, that is independently of a 
given time-invariant state. Usually, the dynamics exists only in the GNS 
representation provided by that state; however, an instance of Galilei 
invariant interaction which gives rise to a norm-continuous group of au- 
tomorphisms of the CAR algebra can be found in [300]. 
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Example 7.1.6 (Infinite Dimensional Quantum Cat Maps). 

The finite dimensional quantization of the torus T? studied in Exam- 
ple 5.4.2 can be turned into an infinite dimensional one by lifting the condi- 
tion (5.86), namely the quantum counterpart of the folding constraint (2.15) 
in Example 2.1.3. Concretely, the Weyl relations (5.83) become 


Uys? = ett iO nina ye UF , 0 E€ (0, 1) : 


where U and V are two abstract unitary operators and n = (nj,n2) € Z?. 
Notice that 26 plays the role of 1/N in Example 5.4.2 and is a continuous 
deformation parameter: when 0 = 0, the commutation relations are those 
that hold for the exponential functions (2.21), namely 


Enem = men - 
Then, as in (5.84), we define the unitary Weyl-like operators 
Wan) := e7” 7 mn UN V, nel; 
that satisfy relations similar to those in (5.85), 
Wo(n) Wo(m) = e” 7em) Wi(n+ m), Vn,meZ?, (7.29) 


with symplectic form o(n,m) := nM — nm. Also, in analogy with (7.11), 
one sets 


Wolf) = X f(r) Wo(n) , (7.30) 
where | f) = {f(n)}nez2 belongs to the subspace 4, (Z?) C (Z?) of square- 


summable sequences with finitely many non-zero components. We shall call 
support of f the set 


Supp(f) := {n EZ: f(n)F o} (7.31) 
The following properties hold for all f, g € @,(Z7), 
Wo(f)' =W), Fin) = f(-n)* 


Wo(f)Wo(g) = Wo(f *g), with (7.32) 
(fx g)(n):= So er m f(n — m)g(m) . (7.33) 
mez? 


Consider the *-algebra Aj := {Wo(f) : f € €*(Z?)} generated by all possible 
linear combinations of Weyl operators Wo(f) with f € ¢*(Z?). Let then w 
denote the linear functional w : Aj + C such that 


w(Wo(n)) = dno - (7.34) 
Using (7.32) with (7.33) one checks that 
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w (Wolf)? We(g)) = (FT =) fm =(flg). (7.35) 


meZ? 


Thus w is a positive normalized functional, namely a state on Aj similar to 
wy in (5.187). Consider the associated GNS representation 7,,(Aj) and set 


If)o:=mo(Wo(f))|2) Vif) ee (Z), 


where | 2,,) is the cyclic GNS vector. Then, the vectors |n )g form an ONB 
in Ho and 9{n|f)o = (n|f) = f(n). Therefore, Ho = £?(Z?) = L2, (T?), 
independently of the deformation parameter 6; also 


To(Wolf))l9) =|f*9) - (7.36) 


The *-algebra Aj can be equipped with a *-automorphism which extends 
to the present case the dynamics discussed in Example 5.6.1.4: it is defined 
on the Weyl operators by 


O4[Wo(n)] = Wo(AT n) , (7.37) 


where A is a 2 x 2 integer matrix as in (5.176). The state w is left invariant by 
Ox, which is then implemented by a same unitary operator U,, for all 0 € [0, 1) 
that coincides with the Koopman operator Ua of Example 2.1.3. Indeed, 


=J f(n) Wo(ATn) = Aa) ) Wo(p) = Wo (Ua f) , 


(7.38) 
for f(A7Tn) = (Ua f)(n) (compare (2.1.3)). Consequently, 


w (We(f)Oa[Wo(9)]) = (2| m(Wo(f))! Uo m(Wo(g)) |2) 
= w(Wo(f)Wa(Uag)) = (f |Ua |g). (7-39) 


The dependence on 0 € [0, 1) emerges when considering the closure of Tu (4%) 
with respect to strong-operator topology thus obtaining von Neumann sub- 
algebras Mọ C B(HL,,). 

While the GNS Hilbert space and the unitary implementation of the dy- 
namics are the same for all von Neumann dynamical triplets (Mo, O,,w) 1}, 
for 0 = 0 Meg is isomorphic to the maximally Abelian von Neumann algebra 
L3°.(T?) of essentially bounded functions on T? (see Section 5.3.2), for 6 ir- 
rational Mog is a hyperfinite JJ; factor, while Mg is not a factor and of finite 
type In when @ is rational. 

Let us consider the case 0 = 0; clearly, Mo is an Abelian von Neumann 
subalgebra of B(H,,), actually, maximally Abelian since it has a cyclic vector 
| Ru} (see Example 7.1.4.1); therefore, it is isomorphic to LÆ (T?) via the 
argument of Theorem 5.3.3 and a one-to-one mapping 


16, and w denote the extensions of the automorphism Oa and of the state w 
from Aj to the strong-operator closures Mg. 
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en > Bjen] = Wo(n) (7.40) 


between the Weyl operators Wo(n) and the exponential functions (2.21). 
Indeed, one observes that the multiplication of vectors in L3,.(T?) by en 
and the action (7.36) of Wo(n) on vectors in He do coincide. We shall thus 
identify Mo = LÆ (T°). 

Consider now the case of a rational deformation parameter, 6 = p/q, 
p,q € N; then, (7.29) yields 


W/q(qn)Wyjq(m) = e?r ipon) (art +m) = Wpja(Mm)Wp/alan) , 
for all n,m € Z?. Further, set 
Z? > n = (m,n2) = [n]+ < n >:= ([nm]+ < mi >, [n] + < n2 >), 


where, for any n € Z, [n] = qm denotes the unique multiple of q such that 
0<n-—[n] =:< n >< q-— 1. Then, one gets 
Wojq(™) = Woyq(r]) Wpya(< n >) . 


As a consequence, when @ is rational, every Wp/q( f) can be written as 


Waef= do ‘s f([n]+ < n >) Wpyq(lr])) Wyya(<n >) 


<n>EJ(q) [n] 


= So X5(s8)Wyyq(s) with (7.41) 
seJ(q) 
= X f(qn+s) Wyyjq(an) E MO , (7.42) 
neEZ? 


J(q) := fs = (81,52) :0< s; <q- a and where 
M = { Ý f(r) Wojqlar)} (7.43) 
neZ? 


denotes the von Neumann subalgebra of M,/, linearly generated by the Weyl 
operators of the form W,/4(qn), n € Z?. Because of (7.41), they commute 
with Mp/q whence M(® belongs to the center of Mop /q- 

Moreover, the exponential functions of the form Wo(qn) fulfil 


Wo(qn)(r) = Wolan)(r + s/q) Vse J(q). (7.44) 


They generate a *-algebra whose strong-closure is a von Neumann subalgebra 
Mo) C Mo of essentially bounded functions f on T? such that 


f(r) = WLP) = f(r + 8/9) , (7.45) 
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for all s € J(q). Let Hs : Mo + MẸ be defined by 


Mo > f = Ilf] := Y flan +s)Wo(qn) € MẸ , (7.46) 


nEZ 


where s € J(q). By decomposing 


f= >> Fm) Woln)= Y (X Sm + 8) Wolam)) Wols) , 


n seJ(q) m 


it follows that I, can be recast as 


Tlf] = Wols) D Ierta, (7.47) 
te J(q) 


Furthermore, a map similar to the one in (7.40), 
Wo(qn) > &,[Wo(qn)] = Wpqg(an) Ynez’, (7.48) 


makes M( and Mo isomorphic so that (7.42), respectively (7.41) read 
X;(s) = ®,[Ls[f]], respectively 


Woiglil= XO Sale Ways): (7.49) 
seJ(q) 


Concluding: 1) My q is not a factor, 2) due to the finitely many non- 
commuting W,/¢(s) with s € J(q), the type of Mp/q is finite In (see the 
discussion of types preceding Section 7.1) and 3) Mp/q is hyperfinite for such 


is Mo (and thus M\”) according to Remark 7.1.1. 
For @ irrational, Mg is a factor; indeed, from (7.29), 


[Wo(n) , Wo(m)| = 2i sin(27 0o (n, m)) Wo(n + m) (7.50) 


cannot vanish for n # m, whence the center Z = Mg N Mọ is trivial, that is 
it consists of multiples of the identity only. Since the state (7.34) is a trace 
on Mg, according to the discussion following Definition 7.34, Mg is a type 
IT, factor and also hyperfinite [263]. 


In the commutative setting of Example 2.2.3, the unitary U, corresponds 
to the Koopman-von Neumann operator which cannot belong to the commu- 
tative von Neumann algebra Mo = L% (&). 

As much as in this case and unlike for finite level quantum systems, the 
quantum dynamics is typically implemented by unitary operators which map 
the algebra of observables into itself, indeed 


Ui (0) m(X) Ua(t) = mo(O(X)) € molA) , (7.51) 
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without U,,(t) itself belonging to tu (A). Given 7,,(A) and U,,(G), one can 
however consider all linear combinations of products of operators in 7,,(A) 
and elements U,,(t) which can always be reduced to the form (compare Ex- 
ample 5.3.2.7 for a similar structure) 


XO To(Xi) Un) . (7.52) 


icl 


These elements form an algebra, denoted by {mu (A), Us (G)} whose bi- 
commutant, namely its strong closure on H, turns out to be a useful tool to 
discuss ergodicity and mixing in quantum dynamics. 


Definition 7.1.5 (Covariance Algebra). Given a quantum dynamical sys- 
tem (A, O,w) and the GNS implementation of the dynamics, the associated 
covariance algebra is the von Neumann algebra Ru := {Tu ( A), Uu(G)}”. 


As we have seen in Section 5.3, besides the von Neumann algebra Ra itself, 
what is also important is its commutant; in particular, in the framework of 
the GNS construction, for what concerns the convex decompositions of the 
reference state w (see Remark 5.3.2.3). As regards the covariance algebra 
Ra and its commutant R!,, notice that if X € B(H,,) commutes with Rw, 
it must commute with both m, (A) and U,,(G). Vice versa, if X € B(H,,) 
commutes with 7,,(A) and Us (G), by continuity, it also commutes with the 
von Neumann algebra generated by them. Therefore, 


Ri, =m.(A)' N U,(G)’ , (7.53) 


where U,,(G)' is the algebra of the bounded constants of the motion, that is 
the algebra of all X € B(H,,) such that U,,(t)' X U.,(t) = X. 


Example 7.1.7. For the case of Example 5.6.2 the covariance algebra is 
n T ) n 
Ry = (mp(Ma(C)) U U,(R)) = (Ma(C) @ 2 U p* @ 0) 
= Mo(C) 8 {12,03} , 


where {12,03} stands for the commutative algebra of 2 x 2 matrices which 
are diagonal in the eigenbasis of p. Furthermore, the constants of the motion 
are contained in U,(R)’ = {12,03} ® {12,03} and 


Ry = Tp(M2(C))’ M U,(R)’ = lly ® {12,03} z 


7.1.3 Quantum Ergodicity and Mixing 


As seen in Section 2.3, ergodicity corresponds to a specific behavior of the 
time-averages of two-point correlation functions. Given a quantum dynamical 
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triplet (A,O,w), two-point correlation functions have the form w(AQ@;(B)) 
where A,B € A and t € G, where G = R or Z. For sake of concreteness 2, 
we will consider averages or invariant means as in (7.3), namely of the form 
(compare Definition 2.3.1) 


T-1 


m [o(A1,(B)O)] = fim = Yo AOB) (T54 
t=-—T 


in discrete time, or else 


m [w(A'O,(B)C)| = Jim T dt w(A'O,(B)C) if G=R. (7.55) 
— 00 -T 


Because of (5.49), it turns out that these averages are bounded, 


mt [w(Ate,(B)C)]| < ||AlBI NC 
Also, in the GNS construction based on the invariant state w, the averages 
ne [w(ATOx(B)C)] = m [( 2. |Tu(A ULE) m.(B) Ua (t) TlO) [Qu] 


provide bounded sesquilinear forms on a dense subset of the GNS Hilbert 
space Hl. This observation together with an argument similar to that in 
Example 5.3.2.3 lead one to introduce 


1. an operator 7, [Uu] € B(H..) with matrix elements 


(Y | mw [Uu] 0) = m KY Uot lp] Vy, EH ; (7.56) 


2. a linear map 7, : At> B(HL,,) defined by 


(| nw [A] 1) = m [( | UE E) (A) Uolt) |6)] Yy, EHe. (7.57) 


Notice that, because of O-invariance, w(n, [X]) = w(X). 


Examples 7.1.8. 


1. Consider a finite-dimensional dynamical system described by a finite- 
dimensional Hilbert space H = C4, by observables that are matrices in 
Ma(C) and by a Hamiltonian H assumed to have non-degenerate eigen- 
values eq > €a-1 > ...e9 = 0 and eigenvectors |j), j = 0,2,...,d—1. 
The dynamics is thus given by (see (5.170)) 


?For more details on the existence of invariant means see [107]. 
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d—1 
U,=e "7 =|0)(0] + Soe? "|G )(5 
j=l 


d—1 
XH X= UXU = YO eM) (kL XZ) LRM, 
j,k=0 


for all X € Ma(C). Then, the time-average 77 yields 


mU)=|0)(0], AX) = > IX) ING. 


Thus, 7: Ma(C) + Ma(C) amounts to the conditional expectation (see 
Example 5.2.9.1) onto the Abelian subalgebra of Ma(C) generated by the 
minimal projections |j )(j |. 


2. All the eigenvectors |j} in the previous example provide, U/;-invariant 
expectation functionals wj on Ma(C). The corresponding irreducible GNS 
representations 7.,,(Ma(C)) (see Section 5.5.1) act on a Hilbert space 
isomorphic to C? with GNS cyclic vectors of the form | 2;) =| j) @|j). 
The dynamics is implemented by unitary operators of the form 

Us, (t) = e tH Q eites la , 
so that they preserve | 2; ). It follows that nu, [U..,] = | 2; )( 2; |, while 
no, [X] = (X) @ | 7)( | for all X € Ma(C). 

3. Consider the projection P, onto the subspace of vectors | w) € Ho such 


that Us| Y) = |w) (the GNS vector | 2u ) is certainly one of them). Then, 
since w, o € Ha are arbitrary, 


(P| nae [U] Po |o) = Te Ky lUs(t) Polo) = (y| Polo) 
implies ne [Uo] Po = P.; analogously, Po nu [Uu] = Pu. Moreover, 


(Y | Uu(5)Nu [Uu] 16) = mel | Uot + s) 1¢)] = (Y | nw [Uu] lE), 


whence 7, [Uu] |) is invariant. Thus, Pons [U.]|¢) = ne [Un] |) for 
all ọ € Hu and then nw [Uw] = Po No [Uu] = Po. 

Notice that nu [Uu] belongs to Rw, for it arises from averaging correlation 
functions; furthermore, since it equals P,,, it does not depend on the 
specific invariant mean used. 


While the average of the time-evolution U,,(t) gives rise to the projec- 
tor onto U,,(t)-invariant vectors (compare Proposition 2.3.4), the average of 
quasi-local observables A € A transforms them into global constants of the 
motion, namely into observables that belong to the strong-closure of A and 
that are left invariant by the dynamics. 
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Proposition 7.1.3. 7, [A] C Tu A)” N U.(G)’. 


Proof: We check this on the dense subset 7(A)| Ru ) C Hu. Let X belong 
to A and X’ to the commutant 7,,(A)’, then, using (7.51), 
(Rw [Tol AİN [X] X'm1.(C) | Qu) = 

= [( Qu | Tol A) To (Or(X)) X’ TlO) [Qu )] 

=m [( Ru | Mu( A)! X’ m(Ox(X)) ToC) [Qu )] 

= (Ro |T (A) X’ no [X] TlO) | Qu) . 
Thus, nu |X] commutes with mu (A)! whence nu [A] E Tu ( A)”. Similarly, 

) 


(Ro | Tol A No [X] Uwls)Tu(C) | Qu) = 
= mh [( 2. | m(A)! UL ETX) U(t + 8) Tu (C) [Qu )] 
= me [( Qu | m(A)! Ua (s)UL(E + 8)m(X)U(t + 8) TlO) |2 )] 
= (Ru | m,(A)TUL(8) mo [X] ToC) Qu) 5 


for all X € A and s € G, whence 7,,(X) € U.,(G)’. 


oy AA 


Example 7.1.9. We have just showed that X € A => no |X] € U.,(G); 
also, by construction (see Example 7.1.8.3), Po = nw [Uw] E Uu(G)”. Then, 
it follows that [n, [X] , Pu] = 0, whence 
(Ru | (A) No [X] Po ma, (B) |22) = 
=e K Rw | T( A)! Po Tw(Oz(X)) Pu Tw(B) [Ru )] 
= (22, | Tu(A)' Po Ta (X) Py Tu(B) [Qu ) 


on a dense set. Therefore, for all X € A, it holds that 


Nw (X| Pa = Pono X] = Poal X) Pi VXEA. 


In the case of classical ergodic systems, these latter systems are equiv- 
alently identified by the clustering properties of their two-point correlation 
functions (Propositions (2.3.2) and 2.3.9), by the spectral properties of the 
corresponding Koopman operator (Corollary 2.3.1) and by the extremal- 
ity of their invariant states (Proposition 2.3.8). Clustering in the mean as 
n (2.65) or (2.75) and extremality are notions that readily extend to the 
non-commutative setting. 


Definition 7.1.6. A quantum dynamical system (A,O,w) is n-clustering if 
m [w(A@,(B))] =w(A)w(B) YA,BEA, (7.58) 


clustering if 
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lim w(AO,(B)) =w(A)w(B) YX,Y EA. (7.59) 


A state v on A is extremal if v = Avı + (1 — A)v2 with O0 < A < 1 and 42 
states on A implies v1.2 = v; v is an extremal O-invariant if it cannot be 
written as a convex sum of other O-invariant states on A. 


Remarks 7.1.5. 


1. If m,(A)” A Us(G)) = {A1}, where {A 1} denotes the trivial algebra 
consisting only of multiples of the identity, then the global constants of 
the motion are trivial. It follows that (A,O,w) is 7-clustering. In fact, in 
such a case 7, |B] = w(B) 1 for all B € A, whence 


mt [w(A@(B))] = (2o | to(A)ne [B] | Qn) = o(A)o(B) . 


2. Suppose the invariant state w in (A, O,w) is not extremal invariant; then, 
there exists a state v on A such that Av < w, for some 0 < A < 1, and 
vo O = v. Thus, from Remark 5.3.2.3, Av(X) = (2, |T'Tu(X) |Q,) for 
all X € A, with 0 < T’ E€ m,(A)’. It turns out that T’ € U(G)’, too; 
indeed, O-invariance yields 


(Ros [TATT Us (t) To (B) | Qu) = (2a | T(A)! T Tw(O_+(B)) | 2u) 
= Av(At O_4(B)) = v(O;(A)! B) 
= (Ro | T(A)t Us (t) T’ Tw(B) [2u ) ; 


on a dense set. Therefore, T’ € R',. 
Consider now the following list (we shall refer to it as ergodic list in the 


following) of statements [107] concerning clustering, extremal O-invariance 
and the spectral properties of the dynamics of quantum dynamical systems. 


w is extremal invariant (7.60) 
R! = {AL} (7.61) 
Ru N R! = {AL} (7.62) 
(A,0,w) is 7-clustering (7.63) 
Py is a one-dimensional projector (7.64) 
Tul A! A RL = {A1} (7.65) 
no [X]=w(X)1 YXEA (7.66) 
w isthe only normal invariant state on 7,,(A)” (7.67) 
| 2. ) is the only invariant vector state in m,(A)|Q.,) . (7.68) 


From Remark 7.1.5.2 it follows that (7.61) > (7.60); also, if RJ, is not trivial, 
then, as in Remark 5.3.2.4, w can be decomposed into a convex combination 
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of 9-invariant states. Therefore, (7.60)< > (7.61). If w is not extremal invari- 
ant, the corresponding operator 1 4 T’ € R’, provides an invariant vector 
state U,(t)T’| 2u) = T'| 2), thus | Ru) is not the only invariant vector 
state and the projector P, onto the invariant subspace of H, cannot be 
one-dimensional, whence (7.68)=— (7.60) and (7.64)= > (7.60). 
Furthermore, using Example 7.1.8, one gets 


me [w(AO:(B)] = (Ru | (A) ne [Uv] ToB) | 2. ) 
= (2,| (A) Po mu (B) | 2o) ; 


together with P,,| Ru} =|, ) and 
w(A)w(B) = {Ro | Tw (A) | Rw )( Ru | ToB) | Qu) ; 


this yields (7.64)<=> (7.63). 

From Proposition 5.5.2 it follows that the normal invariant states p on 
Twu( A)” must correspond to density matrices p € BY (Ho) that commute with 
U..(G); thus, p(mu [X]) = p(m(X))) for all X € A and, if ns |X] = w(X) 1, 
then p(n. [X]) = p(m(X)) = w(X), whence (7.66)= > (7.67). Also, from 
Remark 7.1.5.1, (7.66) > (7.63). 

Other implications are (7.61)= > (7.62)= > (7.65) and (7.67)==> (7.68). 


Summarizing, 


Proposition 7.1.4. With reference to the previous list of ergodic properties, 
the following implications hold 


(7.64) <> (7.63) <= (7.66) => (7.67) 
4 4 
(7.65) <= (7.62) <= (7.61) <= (7.60) 4— (7.68) 


While in a commutative setting the properties (7.60)— (7.68) are equiva- 
lent and each of them identifies an ergodic system, it is not so in the quantum 
realm, unless (A,O,w) is asymptotic Abelian [107, 300]. 


Definition 7.1.7 (Asymptotic Abelianess). Suppose (A, O,w) enjoys one 
of the following properties 


m [w(At [B, O:(C)] D)] =0 VA,B,C,DEA (7.69) 

jim w(At[B,O,(C)]D)=0 YA,B,C,DEA (7.70) 

lim w(At [B, O,(C)|'[B,O,(C)]A)=0 YA,BEA (7.71) 
lim ||[B,@:(C)]||=0 YB,CEA. (7.72) 


In the first case, (A,O,w) is called n-Abelian, in the second one weakly asymp- 
totic Abelian, in the third case strongly Asymptotic Abelian and in the last one 
norm asymptotic Abelian. 
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Physically speaking, asymptotic Abelianess corresponds to the fact that 
the non-commutativity of any pair of local observables is a property which 
dies out asymptotically under the action of the dynamics on one of them. 


Example 7.1.10. Quantum spin chains (see Example 7.1.1) are the simplest 
instance of norm-asymptotic Abelianess with respect to lattice-translations 
on the quasi-local C* algebra A [277]. In the case of spins 1/2 at each site, 
lattice-translations correspond to the shift-automorphism Os : At A de- 
fined by 


n 


ie det] 
Hoi] <The" 


Given A, B € A, for any £ > 0 we can find strictly local A+, Be € Aj_x,x) such 
that |A — A-|| < £ and ||B— B,|| < £. If NS n > 2k, OF[B] E€ Amn-k,n+k] 


commutes with A., |A, on [Bel] = 0, whence 


|[4- ert] < | [4-4 s] + | [+e ent - l| 
< 2e| Bll + 2e(e + [IAll)- 


The mean magnetization m in (7.3) is not a quasi-local observable for the 
spatial-average collects contributions from all local regions: nevertheless, it 
exists in the GNS representations 7}, (.A)” corresponding to the translation- 
invariant states w;,, in (7.1) and (7.2). Furthermore, local non-commutativity 
is suppressed by dividing by larger and larger number of sites with the result 
that the mean magnetization commutes with all local observables. By its 
very construction it also commutes with the GNS unitary operator U„ which 
implements the space-translations on the GNS Hilbert space. Therefore, m € 
Ry = mM (A) NU (Z)’. Since 74, (A)! = {Al} it thus follows that m is a 
scalar multiple of the identity in the two representations. 


The fact that the mean magnetization commutes with all local observables 
holds, more in general, as a consequence of 7-Abelianess; namely, nu [A] C R! 
follows from observing that (7.69) yields 


Remark 7.1.6. Proposition 7.1.3 states that nu [A] © m,(A)” 9 U..(G); if 
(A, O,w) is 7-Abelian then also nu [A] C m,,(A)’, whence no [A] C Ru ARI. 
Since the latter is an Abelian von Neumann algebra, actually the center of Ru 
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(see Definiton 5.3.4), it turns out that [ne [X] 5 nw bal = 0 for all X,Y € A. 


Therefore, using Example 7.1.9, one proves that on a dense set, 
(24| mu(A)* [Pa mo(X) Pas Pars to(¥) Po] tuo B) [Qu = 

=m [( Qu mu(A)t [Pe a(X) Pos Turul) Pal mB) [20 | 

=m (Olata [Pa mo(X) Pa Mo(Oe(¥)) Pal to(B) 120 | 


= (2 tA)! | Potta(X) Pas tha LY] Po] tol B) 2a) 


=m [( Qu | mu(A)t [mu (Ox(X)) Pa sme FT Pa] mo(B) [20 )| 


= (2, | m(A)t Pe [ro em [YT] Pas mu(B) Qs) =0 , 


for all X,Y € A. Therefore, 7-Abelianess implies Abeliannes of Po nu (A) Po 
as a set (in general it is not an algebra). 


If (A, O, w) corresponds to a classical dynamical system, then (7.70), (7.71) 
and (7.72) are equivalent statements implying (7.69). In a genuinely quantum 
setting, however, (7.71), (7.70) and (7.72) take into account the differences 
between convergence in the weak, strong and norm topologies, whereby, in 
general, (7.72) (7.71) (7.70) (7.69). Interestingly, the weakest sort 
of Abelianess is sufficient to make properties (7.60)— (7.68) equivalent to each 
other. Indeed, the consequences of 7-Abeliannes are as follows. 


Proposition 7.1.5. If (A,O,w) is n-Abelian, then R!, = m,(A)” AN Us(GY. 


Corollary 7.1.1. If (A,O,w) is n-Abelian, the properties (7.60)— (7.68) in 
the ergodic list are equivalent. 


Proof: Because of 7-Abelianess, applying the previous proposition one gets 
that (7.65) > (7.66), whence the claimed equivalence follows from Proposi- 
tion 7.1.4. 


Then, the following definition makes sense. 


Definition 7.1.8. An 7-Abelian quantum dynamical system (A,O,w) is er- 
godic if w is extremal invariant. 


Remark 7.1.7. In the case of Example 7.1.8.2, the eigenvectors of the Hamil- 
tonian are extremal as functionals on Ma(C) and thus ergodic according to 
the definition above. However, finite-dimensional systems cannot be asymp- 
totic Abelian; indeed, while condition (7.64) holds, condition (7.66) does not. 


7.1 Observables, States and Dynamics 349 


Proof of Proposition 7.1.5 The proof is based on 3 steps. 
e Step 1: If (A,0,w) is 7-Abelian then P, Ro P, is Abelian. 


Indeed, from Remark 7.1.6 we know that P,,7,(A) P, is Abelian, thus, 
by continuity, also P, 7,(A)” Ps. On the other hand, since the elements 
of Ru are strong limits of operators of the form (7.52), it turns out that 
Po Tu (A)" Py = Po Ru Pu, whence the latter is Abelian. 


e Step 2: If P, R. P, is Abelian, then R’, is Abelian (R/, C R! = Ro). 
Since P, E€ Ru, we can use Example 5.3.2.6 to deduce that 


P, Ro Py C (P, Rw Pu) = PR, P, - 


Further, since | Qu y is cyclic for Py Ru Pu with respect to the Hilbert space 
PH, using Example 7.1.4.1 we conclude that P, Rue Po = P, Ri, Po, 
whence the latter algebra is Abelian. In order to prove the statement, 
we show that R’, and P, Ri, P., are isomorphic; for any X’ € R’/, set 
A(X’) = P, X’ Ps. This map is obviously linear and surjective; further, since 
Po E€ Ru, XY’) = ACX')A(Y’) for all X’,Y’ € R!,. Finally, suppose 
A(Z’) = 0 for some Z’ € R!,, then, since P,,| Ru) =| Qu), 
Z' Tol X)| Ro) = Ta(X) Z P| Q) = ntu X) Po Z' P| Qu) =0 

on a dense set. Thus, Z’ = 0 and A is also injective. 


e Step3: IfX € Aand7,(X) E m(A)N UL(G) then 7, [X] = mu(X) € 
Tul AY. 


This fact can be extended by continuity to the constants of the motion in the 
strong closure Tu ( A)” N Us(GY so that 


m(A)" N USCG) © Tol AY => To (A)! N Us(GY CR! . (7.73) 


If X € m,(A)” O U..(G)’, it commutes with the projector onto the invariant 
vectors P, E Ru, whence X P, = P, X P, implies 


(m(A)" n U.(G)') P, C Pym(A)" Po. 
On the other hand, Example 7.1.9 and Proposition 7.1.3 yield 
Nu [A] = Pa Tu( A) Po © Tul A)” N Uu(G)' , 
whence by continuity 
Po To (A)" Py = (mu(A)” n U.(G)') Ps. (7.74) 


Finally, from the first two steps, (7.73) and (7.74), we derive 
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RL, P, = Py R!, P, C Py Ry Py = Py ty(A)"” P, 
= (ata o v.(G)') P,= (nAn v.(G)') Pa: 


Consequently, given X’ € R!, there exists Y’ € m,,(A)” O U.(G)’ such that 
(X — Y’)| 2,,). As | 2u) is cyclic for Rw, it is separating for R!,, thence 
X’ = Y' so that Ri, C m,(A)” or, equivalently, R!, C Tul A)” N U.(G) 
which, together with (7.73) completes the proof. 


Remark 7.1.8. In case of 7-Abelianess, from R’, = m,(A)” O U..(G)’ it fol- 
lows that R’, is contained in the center Zy = 7,,(A)” N Tul A) of Tu (A) 
and is thus Abelian. Actually, R’, coincides with the commutative algebra 
of Q-invariant classical observables of the quantum system (A, O,w). There- 
fore, if (A,O,w) is 7-Abelian and w is a factor state (see Remark 5.3.2.2), 
then R!, = {Al} and w is extremal invariant and thus ergodic, according to 
Definition 7.1.8. 

If (A,O,w) is 7-Abelian, but the state w is not extremal invariant, that 
is not ergodic with respect to O, then R/, cannot be trivial. However, it is 
Abelian and thus generated by a unique set of minimal projections P; (see 
Section 5.3.2). Each of these projections are such that (U})t P; U£ = P; and 
thus provides a O-invariant state wj on A: 


(Ro | Pj m(X) Ru) 
w(P;) l 


w;(X) = 


Since the P; are minimal projections in R/,, they cannot be further decom- 
posed in R!,, whence they are extremal invariant and yield a decomposition 
w = >), w(Pj) w; of w into its ergodic components. 


Example 7.1.11. [220] Let w+ be states of a one-dimensional spin chain of 
the form (7.4), where (the upper indices label the lattice sites, the lower ones 
the Pauli matrices) 


ds (Tres) - je (=e Mos ou) =] 


- (Iles) = e ( "os ci) =] S14 
l=1 m 


7E 


na T=) 


Unlike the states (7.1) and (7.2) which are characterized by infinitely many 
spins pointing up, respectively down along the z axis, these states are anti- 
ferromagnetic alternating spins up, at even sites w+, at odd sites w_, and 
spins down. These are pure states and give rise to factor GNS representations 
Tw(A)”: in fact, as for the states (7.1) and (7.2), one can show that the 
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commutants 74(A)’ are trivial. Also, by considering the shift-automorphism 


of Example 7.1.10, it turns out that w+ o Os = w+ so that the state 


w+ + w_ 


- (7.75) 


is translation-invariant and can be decomposed in terms of pure states: 


= ( Ro | Q47(A) Ru) 
VEE OO 


with Q+ € m,(A)’. If w could be decomposed into Os-invariant states wi, 
these would correspond to 0 < Q; € Ri, = Tul A)! N Us(G)' that provide 
decompositions of the pure states w+ as well 


(Wy | Q+ Qi (A) [2 ) +4 (Wu | Q+ (1 = Qi)T(A) | 2.5) 
(Ro |Q |u) (Ru | Qt |Qu) 


which is impossible. Therefore w is extremal invariant, but not a factor state. 
As for the spin system considered at the beginning of this chapter, the follow- 
ing even and odd magnetizations M®? = (mi? , mg’, m3’) (see (7.3)) exist 
as strong operator limits in the GNS representations 74 and mTw: 


w+4(A) = 


N N 


e yk H 2i o y H 2i+1 
a A Nl 2s o ma N al pins i 


They commute with all local observables and thus belong to the trivial centers 
Z+ and the non-trivial one Z,,. In the first two ones they are multiples of the 
€ oO 


identity, m4 = (0,0, +4) and m% = (0,0, Fu), while in the representation 
Tu Which can be conveniently split as my (A) = 74(A) ® x_(A), 


neden Sy). adela 9) - 


Furthermore, while enjoying clustering in the mean, the representation Tw, 
which is not a factor, is not clustering; indeed, 


(1) + (1) 
2 


—1)/f _1\e 
w(aiestt) = CY “ sea =i, 


From the previous discussion, it turns out that if (A,O,w) is 7-Abelian 
and w extremal invariant (property (7.60) in the ergodic list), then two- 
point correlation functions factorize in the mean (property (7.63) in the er- 
godic list). Concerning the extension of the classical property of mixing (see 
Proposition 2.3.3, (2.66) and (2.76)) to quantum dynamical systems, due 
to non commutativity, one distinguishes between various way of clustering 
beside (7.59). 
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Definition 7.1.9. A quantum dynamical system (A,O,w) is weakly mixing 


if [42] 
5m w(A@:(B)C) =w(AC)w(B) VA,B,CEA; (7.76) 


strongly mixing [42] (or hyperclustering in [215]) if 
i lim w(A@,(B)CO;(D)E) = w(ACE)w(BD) VA,B,C,D,E EA. 
7 (7.77) 


Clearly, strong mixing implies weak-mixing and weak-mixing implies 
n-Abelianess; it also implies weak asymptotic Abelianess, whereas strong- 
mixing implies strong-asymptotic Abelianess. The proof of the latter state- 
ment (the proof of the former one is similar) comes from (7.77) applied to 

w(At[@,(B) , C]'[@:(B), C]A) = 
= w((C'A)! ©,(Bl B)CA) — w((C'A)! ©;,(B)' C @,(B) A) 
—w(At 6,(B)' CT O,(B) CA) + w(At ©,(B)' CTC O,(B) A) , 


which yields 
Jim _w(At[0:(B), C}'[@,(B), C]A) = 


= w((C.A)! CA) w(Bt B) — w((C.A)' CA) w(Bt B) 
—w(At CT CA) w(Bi B) + w(AtCTCA)w(Bi B)=0. 
Also, weak-mixing and strong-asymptotic Abelianess together imply strong- 
mixing; this can be seen by rewriting 
w(A'[6,(B), C]'[6,(B), C]A) = 
= w((C'A)! ©,(B! B)C A) — w((CA)' O,(Bt B)C A) 
—w(Al @,(Bt B) CÌ C A) + w(A' CTC ©,(BT B) A) 
-w((cayt ,(B") [ct 0,(B)| CA) = w(at 6,(B)t let, @,(B)|CA) 
+w(al @,(B)t [ct C, 0.(B)| A) l 


Now, weak-mixing means that the first four terms factorize in the same way 
and thus cancel each other, asymptotically in time; on the other hand, each 
of the terms with the commutators vanish asymptotically if the system is 
strongly asymptotic Abelian. This comes out from the Cauchy-Schwartz in- 
equality (5.49) which gives upper bounds of the form 


lo((c4)t aB) [ot @,(B)] c) Í < 


< w((CA)t ©,(Bt B) CA) w( (CA) let, 2,(B))' [ct 0,(B)| CA) 
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Proposition 7.1.6. Given a quantum dynamical system (A,O,w), we have 
the following implications: 


1. strong-mizing (7.77) implies weak-mixzing (7.76); 

2. strong-mizing (7.77) implies strong asymptotic Abelianess (7.71): 

3. weak-mizing (7.76) implies weak asymptotic Abelianess (7.70); 

4. weak-mixing (7.76) and strong-asymptotic Abelianess (7.71) imply strong- 
mizing (7.77). 


Remark 7.1.9. If the state w is faithful then weak-mixing is equivalent to 
the factorization of two-point correlation functions (see (7.59)). Of course, 
if (A,Qw) is weakly mixing, then w(X O;(Y)) asymptotically split into 
w(X)w(Y). Vice versa, using the KMS conditions (7.27), if (7.59) holds, then 


lim w(A O(B) C) = im wlow(i) (C) AO,;(B) 


= w(o.,(3)(C) A) w(B) = w(AC)w(B) . 


The same conclusion that (7.59) implies weak-mixing follows if one knows 
(A, O,w) to be weakly asymptotic Abelian; one uses 


w(A O,(B) C) = w(AC @,(B)) + w(A [ex(B) c]) 


The following proposition establishes a link, similar to the classical one, 
between mixing and the spectral properties of the time-evolution group 
U..(G) in the GNS construction. 


Proposition 7.1.7. If (A,O,w) is weakly mixing the following equivalent 
properties hold, 


Proof: Weak-mixing asserts that F lim 7,(O;(X)) = w(X) 1 on a dense 


set, whence (7.78). Furthermore, from 


w(At @,(B)) = (Ro |T (A) UL (t)m.(B)|Q.) and 
w(Al) w(B) = (Ro | m(A)"| Qa) Qu |tu(B) Qu) 5 


for all A, B € A, it follows that (7.78) and (7.79) are equivalent. 


Remarks 7.1.10. 
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1. If (A, O,w) is norm-asymptotic Abelian and w is a factor state, then w is 
clustering as in (7.59) [64]. Indeed, Z„ = {Al} says that the C* algebra 
B generated by m,(A) and 7.,(A)/ has trivial commutant B’ = {A1}, 
namely B” = B(H,,). Let X € A and consider the vector 


Ho > |x) = (m(X) - w(X)I)| 2) . 


It is orthogonal to | 2u ): thus, there exists B(H,,) 3 T = 7" such that 


TI¥x)=0, TQ) =|Qu). 
Actually, T can be chosen in B [64]: the operators 
Cy :=T (tm (X)—w(X)1), Co := (1- T) (m,(X) — w(X)1) 


are in B. Further, C,|2,,) = C| Ru) = 0 and m,(X) = Ci + C2 = 
w(X) l; consequently, 


w(XO,[Y]) — o(X)w(Y) 


(Ro | CT (OX) |2) 
= (Ru | [C1 , TOX] |20) - 


Now, for any £ > 0 one can approximate Cı € B by a finite sum in 


Tw(A) U Tul A): 


E 


lo-Zeor|s r 
j=1 


so that 


|w(XO,[Y]) — w(X)w(Y)| < 2 kes [ru(Cr), ru(4;)|B; (2 )| +E 


Thus, since (4, ©, w) is assumed to be norm-asymptotic Abelian, then it 
turns out to be clustering whence, according to Remark 7.1.9, also weakly 
mixing. 

2. If (A, O,w) is norm-asymptotic Abelian and w is an extremal KMS state, 
then it is a factor state (see Remark 7.1.3.4) and the system is weakly 
mixing. 


Example 7.1.12. The infinite dimensional systems of Example 7.1.6 provide 
an interesting framework to apply the previous abstract considerations [44]. 
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Since the O,-invariant state is tracial, as explained in Remark 7.1.9, cor- 
relation functions as those appearing in (7.76) can be reduced to two-point 
correlation functions as in (7.59). Then, (7.39) yields 


w (Wo( f)OnlWo(9)]) = (F1Ualg) - 


Therefore, if the underlying classical system is mixing, that is if the Koopman 
operator U4 has absolutely continuous spectrum on the subspace orthogonal 
to the constant function ( namely to the GNS cyclic vector | 2, }), then, for 
all f, g € L3, (T°), 


lim w (Wa(f)OLIWolg)]) = lim _(f1UK lg) 

(f° |2)(2|g) = w(We(f)) w(Wa(g)) . 
This means that, independently of the deformation parameter 0, the quan- 
tum dynamical systems (Mg,O,,w) are mixing when such is the classical 


dynamical system of which they represent a quantization. 
As regards strong mixing (7.77), observe that (7.50) implies 


D ([man. ƏL Wi(m)]] [Wo(n). 04 [Wolm)] = 


= 4Asin?(2700(n, (B)'m)) , (7.80) 


where we have set B = AT, the transposed of A. Since o(n,B'm) is an 
integer, when 6 € Q is rational, the right hand side of (7.80) is periodic 
in ¢ and cannot vanish when t — +oo. Therefore, the quantum dynamical 
systems (Mp/q, OA, w) cannot be strong asymptotic Abelian, and thus not 
strongly mixing, because of Proposition 7.1.6. 

If @ is irrational, then, as proved in [44], the right hand side of (7.80) 
vanishes asymptotically at most for a countable set of 6 € [0,1]. A concrete 
construction of a countable set of 6 is as follows [209]. 

Let t > 0; using (2.17)— (2.19) in Example 2.1.3 with b and c exchanged, 
one explicitly computes 


a(n, Bim) C (m)at (niaz = n2Q1+4) + C (mja ™(niaz— = n2a1—) 


™m1a2— — M201— a 
= a! [= (ni aa4 —n2a14) + O(a-*) 


= il ~ 02) (minila — a)(a — a) 


+monzb? — mın (a7! — a) — mono (a — a)) : 


The transposed matrix B = AT has eigenvalues a+! as A; therefore, Tr(B’) = 
at +a™ € Z for all t > 0. It thus follows that 
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a(n, B'm) (mını c — monab — (ming + Mmənı)a 


az—1 
+myngoa! + mony a) + O(a") 


2 


1 
ara 2ra + O(a) 
k=0 


2 
1 ; 
zi Sore Cas + oH) + O(a) , 
k=0 


where the coefficient rg € Z and thus also the sum is an integer. 
Choose 0 = a?s mod (1), s € Z, then 


t 


m) = s(a? — 1)o(n, B'm) mod (1) = O(a‘) mod (1) . 


do(n, 


Therefore, when t — +00, the function in (7.80) vanishes and norm- 
asymptotic Abelianess holds. Indeed, consider Wọ( f) and Wọ(g) where f 
and g have compact supports, namely, there exists K > 0 such that 
f(n) = g(n) = 0 when ||n|| > K; then, 


| [WaA) , On[Wolg a = a If(m)| |g(m™)| | , Wo(B'm)] | 


a a |f(m)| |g(™)| Crm (7.81) 


for a suitable constant Cn,m. Because of the assumption on Supp(f) and 
Supp(g), (7.81) goes to 0 with t > +oo. 

Thus, for 0 = sa? mod (1), s € Z, (Mọ, Oa,w) are weakly mixing and 
norm-asymptotic Abelian; whence, according to Proposition 7.1.6, strongly 
mixing. 


7.1.4 Algebraic Quantum K-Systems 


The notion of K-systems is naturally extended by removing the Abelian 
constraint from Definition 2.3.6. 


Definition 7.1.10. Let (A,O,w) be a quantum dynamical system; if A is a 
C* algebra it is called a C* algebraic quantum K-system if there exists a C* 
subalgebra Ao C A such that 


1. Ag: O [Ao] = Ati for allt € Z; 


2. Viez A; = ; 
3. Mez At = {Al}, 


7.1 Observables, States and Dynamics 357 


where Nicz At denotes the set theoretic intersection of the C* subalgebras A; 
and Vcg A: the C* they generate by norm closure. The nested sequence will 
be called a quantum K-sequence. 

If A is a von Neumann algebra acting on a Hilbert space, then (A, O,w) 
is a von Neumman algebraic K-system if there exists a quantum K -sequence 
of von Neumann subalgebras An, with \Vyez At denoting the von Neumann 
algebra obtained by strong-operator closure. 


Remark 7.1.11. Typically [220], starting from a C* algebraic quantum K- 
system with K-sequence A+, one considers the GNS representation 7,,(A) 
and the sequence of von Neumann subalgebras 7,,(A;) and checks whether 
it is a (von Neumann) K-sequence for (7,,(A)”, O,w). 


We have seen in Section 2.3 that classical K-systems enjoy the strongest 
possible clustering properties corresponding to K-mixing; to some extent 
this notion extends to von Neumann quantum K-systems. Let a quantum 
dynamical system (A, ©, w) posses a sequence {.A;}1ez of C* subalgebras such 
that, setting M := Tu ( A)” and Mi := Tul At)”, {Mi}tez is a K-sequence 
for the von Neumann triplet (M, O©,w). We assume w to be a faithful state 
and the Mo to be invariant under the modular automorphism og,,, so that 
Proposition (7.1.1) ensures the existence of a normal conditional expectation 
Eo : M+ Mo which respects the state. It thus follows that the CPU maps 
E := O, o Eo 0 O+: M+ M are w-preserving conditional expectations. 
Setting 

Pino ( A)| R2, = Eilr (A)| 2. ) VAEA, 


one obtains a bounded linear operator which can be extended to a bounded 
operator P; : Hu ++ H, where H; is the closure of the linear span Tu (A)| Ru ) 
and Ho is the GNS Hilbert space corresponding to w. 


Proposition 7.1.8. The P, are projectors such that P, = U} (t) P) U.(t), 
where U„(t) is the GNS unitary operator which implements O on Hu. If 
{Mihiez is a K-sequence, then [217] 


1: P< Paa for all t € Z; 
2.8— limp. 4.60 P = 1; 


3. s= limis P = | 2.5)( Qa |. 


Proof: From the properties of the conditional expectations (see (5.44) in 
Proposition 5.2.2), it follows that P? = P,. That P} = P, follows from the 
assumption that w o E; = w; indeed, using (5.42) and (5.43), one gets 


(Tu (A) Qu | Pits(B)Q.) = w( At E,[B]) = w(E:[A]! E[B])) = w(E[A]*) B) 
( Pentul A) Ro | 7. (B) 2u) ; 
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for all A € A (thus on a dense subset of Hu). Also, on a dense subset, it holds 
that 


Pit(A)| Qe) = O: fe} Eo O O_17.,(A)| Ra) 
= UL (t) Eo[U..(t)mu(A)UL (|| Ro) 
= Us) Po Us (t) Tu (A)| 2.) i 
This proves the second statement of the Proposition, while, of the last asser- 


tions, the first one is a consequence of M, C M1 and the last two relations 
follow from 


„lm (m(A)* |(P;- 1) [ro(B)) = „lim w( At (E[B] — B)) = 0 


lim (mu(A)*| (Pe —| Qu )( Mo) fro(B)) = lim w(At E[B]) - 
—w(A)w(B)=0. 


In fact, these two limits imply weak-operator convergence of projections to 
projections which is equivalent to strong-operator convergence. Notice that 
the second limit holds since E, maps onto the trivial algebra when t — —oo 
and w(F;[B]) = w(B). 


Corollary 7.1.2. Let (M,0,w) be a von Neumann algebraic quantum K- 
system as specified above; then, for any € > 0, Ao € Mo and A E€ M there 
exists T > 0 such that 


w(Ap@r[4]) — w(Ao)w(A)] < e y w(Ao AB) 
for allt < -T. 


Proof: The result is a consequence of the second strong-operator limit in 
the previous proposition and of [217, 300] 


w(Ao@[A]) = w(Ap Po Ut (t)A) = w(Ao UL (t) Pat A) , 
whence 
w(Ay@[A]) — w(Ao)w(A)| = |w (Ao U(E) (P; = 12 )( 2x1) A)| 


< (Ao Ap) II(Pe — | 2o X Ro |) A] Qu YI - 


Corollary 7.1.3. Let (M,0,w) be a von Neumann algebraic quantum K- 
system as specified above; then, it is weakly-mixing. 
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Proof: Because of Remark 7.1.9, one need show 


im w(AO,[B]) = w(A) w(B) VA, BEM. 


Since {M,}+ez is a K-sequence, let ¢ > 0 and choose Az € Mo and s € Z 
large enough such that 


|(A = O5[Ae])| Qu )| Se - 
Then, for t sufficiently large, 
w(A@_,[B]) — w(A)w(B)| < loa Gj (A.))0-.1B))| + 
+|u(4eO_(+5)[B]) - w(A)w(B)| < 2 IBI 


This shows clustering when t + —oo; when t — +00, one uses the modular 
automorphism to rewrite 


w(A@,[B]) = w(@_,[A]B) = w(o/,[B]0-1[A]) , 


w 


and then applies the previous argument. 


Remarks 7.1.12. 


1. The result in Corollary 7.1.2 is the maximum of uniformity one can 
achieve in clustering; indeed, if 


w(BO,[A]) — w(B)w(A)| < €\/w(B Bt) 
for allt < —T and B € M, then B = O,[A'] would yield 
0 = w(At A) — |w(A)|? = w((at —w(A)*)(A— w(A))) ; 


As w was assumed faithful, this gives A = w( A)1 for all A. 

2. By substituting Ag with O,[{Ao] for fixed s, the uniformity in Corol- 
lary 7.1.2 holds with respect to any fixed Ms. 

3. The structure of the nested sequence of Hilbert subspaces {H; hez corre- 
sponding to the projections P, very much resembles that arising from 
the Lebesgue spectrum of classical K-systems (see the discussion af- 
ter Remark 2.3.5). However, the projections {P:}zez have been con- 
structed by relying on the existence of w-preserving conditional expecta- 
tions E : M+ M. For a state like the tracial state which has trivial 
modular automorphism, they surely exist; however, this need not be true 
in general. Luckily, the previous results can also be proved without refer- 
ring to the existence of a K-sequence of projections [220]. 
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Example 7.1.13. As sketched in Example 2.3.3.4, the classical hyperbolic 
automorphisms of the torus are K-systems. Aided by this fact we shall 
show that, for rational values of the deformation parameter 6 = p/q, their 
quantized versions (Mg g,9,w) are algebraic von Neumann quantum K- 
systems [44]. 

According to Example 7.1.6, we shall identify the classical automor- 
phisms of the torus as triplets (Mo, O,,w), where Mo is the von Neumann 
Abelian algebra of essentially bounded functions on T?, O4 is such that 
Oalfl(r) = f(Ar), f € Mo, and w is the integration on T? with respect 
to the uniform measure dr. Therefore, the K-partition characterizing them 
as K-systems amounts to the existence of a K-sequence {M4 hez of von Neu- 
mann subalgebras of M (see Definition 2.3.6). 

Because of the decomposition (7.49), let M+ C M,/q be defined, with 
obvious use of the notation, as 


Me := XO Blw] Wojq(8) - (7.82) 
se J(q) 


In this way, the K-properties of the classical K-sequence {M; hez would 
make the characterizing properties in Definition 7.1.10 also hold for the non- 
commutative sequence {M }rez of subalgebras of M,/,. The first two condi- 
tions are in fact immediate, while the third one comes from the fact that (7.46) 
implies Ms[1] = ds 0. 

Unfortunately, one has first to ensure that the M; are subalgebras, namely 
that, if f,g € Nj, also 


SO GalFeLF] Ballo] Wo/q(8) Wp/alt) = 
s,teJ(q) 


= SO Sy {e[f] B, Mig] Wp/q(als + tl) Wojq(< s +t >) 
s,teEJ(q) 


DO [sl] Plg] Wolals +t)| Wpyq(<s+t>) (7.83) 
s,teJ(q) 
belongs to Mz. 


To this purpose, consider the von Neumann subalgebra Mo C Mo 
consisting of those essentially bounded functions on T? that satisfy (7.45). 


Since Mo is mapped into itself by O,, the K-properties of the sequence 
{Ni yez extend to the sequence {NO = MN MP Jez, whence the triplets 
(M\, @x,w) are also K-systems è. Further, let B denote the (von Neumann) 
subalgebra generated by the characteristic functions x 4s) of the partition of 
the torus into atoms 


i iti 
A(s) = fr : Ž < ris FE prakak 
q q 


3Here, O4 and w denote the restrictions of the dynamics and the state to Mo 
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t 
s=— 00 


where sin € J(q); set B; := Oa(B), By := V 
von Neumann subalgebras 


B,. Finally, construct the 


N, = NO V By , 


and consider the sequence {N hems 

This is a K-sequence for Mo; indeed, NOE Ni directly follows from the 
analogous property of the K-sequence {M9 hez, while Viez M = Misa 
consequence of the fact that V,ez N{® = M\ together with the observation 
that Mo = M V B C V,ez M. Finally, Arez Ni = {Al} follows from the 
fact that, according to Proposition 2.3.5, Tail (B) = {Al}. 

Let us now insert the subalgebras Nj; in the place of M; in (7.82); us- 
ing (7.47), for f € N, 8 € J(q), let us consider 

Half] Wel) = = > Pe, 


a te J(q) 


Now, it turns out that the map in (7.45) fulfils 


PNO = eals (r+) =s (are 2) = s (an + <A8®) 


= 64 hEN 
Since ae maps the subalgebra B into itself for all s € J(q), it turns out that, 


when f € Nj, all the components in (7.82) also belong to Nj. Therefore, with 
f.g E NM, it follows that 


Ni, 3 H[f] Wo(s) Helg] Wolt) = (Hel f] Hela] Wolals + €])) Wo(< s +t >). 
A et 
Ell <s+e> (Ne) 


This shows that the linear sets in (7.83) are subalgebras of M,/ and com- 
pletes the proof that the quantum dynamical triplets (Mp/q; Oa, w) are al- 
gebraic von Neumann quantum K-systems. 


Remark 7.1.13. The reason why all quantized hyperbolic automorphisms of 
the torus are algebraic K-systems for rational deformation parameters is that 
the properties of their classical counterparts are inherited by the commutative 
subsystems (the centers) contained in (M,/q,0a,w). When the deformation 
parameter is irrational, this is no longer true and indeed one can prove that 
a part from countable sets of 0 € [0,1] (Mo, @a,w) cannot be algebraic K- 
systems [44]. 
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7.1.5 Quantum Spin Chains 


As sketched in Example 7.1.1, the algebraic structure of quantum spin chains 
is as in Definition 2.2.5, the difference being that at each site of the one- 
dimensional lattice indexed by the integers k € Z, instead of diagonal matrix 
algebras, there remain assigned copies A; of a same non-commutative algebra 
A. In the following, we shall consider A = Ma(C), namely chains consisting 
of linear arrays of d-level quantum systems (or spins). 

The algebra Az associated with the infinite chain is the quasi-local C*- 
algebra which arises by taking the norm closure of the *-algebra consisting 
of operators from all local algebras Aj_¢g := Qi- Ak = Ma(C)®@49, 
supported by the lattice sites in the interval [—@, 4]. If Ao denotes the strictly 
local * algebra Uey Aje, then Az = Al, 

The local algebras Aj_¢. = Qi- Ap describe spin arrays located at 
finitely many lattice sites —£ < k < @. Their elements are linear combinations 
of tensor products @t__~Ar, Ax E€ Ag. If 0 < L< p, the local algebra Aj_¢ 4 
can be embedded into Aj—p,p] as follows; we shall denote by 

Vi 5] — 8i; lk ' qj = git olk j th at = Opa jg lk (7.84) 
the tensor products of identities at sites from 7 to j, from —co to i — 1 and 
from j + 1 to +00, respectively. Then, Aj;_¢ is embedded into Aj_, ») as 
li-p,—2-1] D Aje] Q Iie+1,p]: Analogously, Aj_¢ is embedded into Ag as 
1_¢_1) @Al_¢.4 ® lie+1. In the following, for sake of simplicity, we shall often 
identify local algebras A;_¢ 4 with their embeddings, as well as their elements 
as elements of Ap. 

The dynamics over Az is the shift automorphism Os : Az — Az 


Ool Aje) = Al—e41,¢41] 
9e (1e & (k-er) & Tea ) =14® (aadi) 8 Deyo - 


In order to complete the description of quantum spin chains as quantum 
dynamical systems we need provide Az with translation invariant states, 
that is with positive functionals w : Az +> C such that wo O, = w. Given any 
such state, its restrictions wy; ,; := w[Aj,;) to a local subalgebras Ay; j} := 
Qj; Ax are density matrices pj; j) E€ Ma(C)®9-*). Since it originates from 
the global state w, the family of pj) is automatically compatible with the 
embedding of Ajj) C Au j+1]; that is they fulfil the condition 


w(Aj®-++A; Q 1y41) = Trug (04,541) 41 ® > Aj ® 1541) 
= Tris] ((Trij+1}Pli +1) Aj @-:: Aj) 
= Trug (Pu jAi 8 Ag) 
where Try j] indicates that the trace has to be performed with respect to an 


orthonormal basis of the Hilbert space (C4)80-i+1) associated with the spins 
at sites i < k < j. In other words, 
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Tr 4541} (Pli j+) = Plij] > Vij EZ. (7.85) 
Further, translation invariance implies 
w(@,(A; 8 --- A;)) = w(l; 8 A; @--- Aj) 
= Trj (Pujli 8 As 8- Ay @ lj) 
= Trus) (Triat) 48 A) 
= w(A; Q- Aj) = Trug (p34 8 Ag) 
whence the local states satisfy 


Tra} (Pu, j+1]) = Pp+1,9-41] = Plij] > Vi jEZ. (7.86) 

Vice versa, if Az is equipped with a family of local states pj j}, i,j € Z, 
satisfying (7.85) and (7.86), then they define a translation invariant state w on 
Az such that its restrictions to local subalgebras satisfy w [Aj;,;) = Pu, [10]. 


Definition 7.1.11 (Quantum Spin Chains). 
Quantum spin chains are dynamical systems represented by algebraic 
triplets (Az,O,,w) where 


1. Az is a quasi-local algebra with a d-level system at each site; 
2. Os : Åz œ> Az is the shift-automorphism over Az; 
38.wW:Aze C is a translation invariant state over Az 


Example 7.1.14. Quantum spin chains turn out to be C* algebraic quantum 
K-systems; indeed, one argues in the same way as for classical spin chains 
(see Remark 2.3.5). The K-sequence consist of the quasi-local algebras A; := 
Ot (Ao) where Ao C Az is the quasi-local algebra which arises as the C* 
inductive limit of the local matrix algebras Aj, 4), with p < q < 0. 

If w is a factor state, (Az, O,,w) is also a von Neumann algebraic quantum 
K-system. This can be seen as follows: denote by Ap the quasi-local algebra 
generated by all matrix algebras of the form Aj, 4) with t < p < q and set 
Mi = Ta(Ai)", Me = Tol Ap)”, M, := (M) for the various commu- 
tants. Clearly, the first two conditions in the second part of Definition 7.1.10 
are satisfied. The third one is obtained as follows: since Mp1 C M4, one 
finds [220] 


(A m) = UMi = UM; Mi) = (UMi) u (U Mer) 


teZ tEZ tEZ tEZ teZ 
=MUM=(MnM’ =A 


for w has been assumed to be a factor whence the center Zu = M N M’ is 
trivial. 

This is not true in general; for instance, in the case of Example 7.1.11, 
the state (7.75) is not a factor and the von Neumann system is not clus- 
tering. Therefore, according to Corollary 7.1.3, (M,6,,w) cannot be a von 
Neummann algebraic quantum K-system. 
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Ergodic Quantum Spin Chains 


In Remark 7.1.8, we have seen that asymptotic Abelianess allows to uniquely 
decompose non-extremal invariant states into their ergodic components. 

As a concrete application, consider a quantum spin-chain (Az, Oc, w) 
whose state w is extremal invariant (see Definition 7.1.8) with respect to 
the lattice translation by one site, O,, but not with respect to lattice trans- 
lations by £ sites, O£, for some £ € N. We prove the following result [58]. 


Proposition 7.1.9. Let (Az,O,,w) be an ergodic quantum spin-chain. For 
any L E N the state w can be written as a convex decomposition 


=— j 7.87 
u= 2 bs (7.87) 


j=0 


where ne divides ¢ and the wj are shift-invariant states over the spin-chain 
which are ergodic with respect to O£. 


Proof: Consider the commutant (R£ Y := mu ( Az) N {UEY of the covari- 
n 

ance algebra (see Definition 7.1.5) RE := (rolA) U us (Z)) which is built 

by means of the C* algebra 7,,(Az) and the group of unitary GNS operators 

Um n EN, instead of U,,(Z). 

Since Az is norm-asymptotic Abelian and w is assumed not to be O£- 
ergodic, because of Corollary 7.1.1, it turns out that R£, A {A1}. Let {Qi hier 
be a decomposition of the identity by orthogonal projections in (R£ )’; then, 
the cardinality of I, nz, must fulfil ne < £. 

Indeed, let P denote any of the Q;, i € I, and set Pj := UJ P (U), 
0<j <€-1. Since P commutes with 7,,(Az), one derives 


Ty(X)P; = Uzm.,(O7[X])P (UL)? = ULP To (O [XUL = Pim.(X) , 


for all X € Az. Moreover, P; commutes with U£ 


£, whence P; € (R£) for all 
0< j <£-—1 and so does P := Vv Pj, namely the smallest one among the 
projections Q such that Q > P; for all0 <j < 4-1. 


By decomposing n € Zasn=mél+r,0<r< @—1, it follows that 
n l— n — 
Therefore, P is O,-invariant, whence P = 1 as w is ergodic with respect to 
Os-ergodicity of w. Then, (7.61) in the ergodic list yields 


1=(2|P[2) < (2P; |2) = e(2| PQ). 
j=0 
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Applying this argument to each of the Q;, one obtains 


=(2| X Q;:|2)> 


wel 


Ne 
7 : 


Since A is norm-asymptotic Abelian, from the second step in the proof of 
Proposition 7.1.5, one deduces that (R£) is Abelian. Let then Q;, 0 < j < 
ne — 1 < £, be its minimal projections (see Example 5.3.4.1) with Qo such 
that 

go = (2|Qo|2) < g = ( 2] Q; |2) 


for all j > 0. Then, introduce the set 
So = fj €Z : UZ Qo (Ul)! = Qo} . 


It follows that Sg D ¢Z; further, set ko := min{0 < j € So}. Then, ¢ « ko; 
otherwise, l = p ko + q, for some p > 0 and 0 < q < ko, so that q € So thus 
contradicting the minimality of ko. 

Furthermore, set Qo, := U3 Qo (U},)j, 0 < j < ke; Qo,; belongs to (REY. 
If it is not a minimal projector Qj;), then, Qo,; = >); Qi thus contradicting 
qo < qj when j > 0. Thus, Qo; = Qio) If Qo = Vio Qs = 2520 Q (the 
projectors are now orthogonal), it follows that, as shown before, Qo € E (RLY 
and hence Qg = 1. Consequently, because of the uniqueness of the orthogonal 
decomposition of the identity in an Abelian algebra, it turns out that ko = ne 
whence Q; = UZ Qo (Ui) and qo = qj = n7 +. 

One can now introduce the states on Az defined by 


Az 3 X > w3(X) = re (2| Qym(X) |2) = (2| Qoru (O9(X)) 12) 
= wo(@3[X]) , 
for all X € Az. It turns out that the w;’s are all Of-ergodic, otherwise it 


would be possible to further convexly decompose them (hence w) into O£- 
invariant components: 


nge—1 


= Do Anes , ws 3 D on. 
As explained in Remark 7.1.5.2, the decomposers wj; correspond to projectors 
Pji € (REY such that A (Q| Pji|2) and 
ne 
Ajiwji(X) = ne (Q | Pjit.(X) |2) 
< wj(X) = ne(2| Pjit(X)|2), VX E€ Az. 


As the projections Pj; and P; belong to the commutant 7,,(Az)’, by choosing 
X =Y'Z one gets 
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(O| TaY)! Pye m(Z)|@) < (2|T(Y) Pj m(Z)|Q) - 


Since Y and Z are arbitrary elements of Az and the vectors 7,,(X)| Q2) are 
dense in the GNS Hilbert space, it turns out that Pj; < Pj. But the P;’s are 
minimal projections, thus Pj; = P}. 


Finitely Correlated States 


An interesting class of translation-invariant states on a quantum spin-chain 
Az is constructed as follows [113]. Let (B, p,E) be an auxiliary triplet, where 


1. B is a finite dimensional algebra that we shall fix to be the algebra M)(C) 
of b x b matrices acting on C? ; 

2. pis a state on B identified by a density matrix: p(B) = Tr(p B). 

3. E: A => B is a completely positive map such that 


poEK(1, 8 B)= p(B), (7.89) 


where l4,g denote the identities of the algebras A, respectively B. 


Since they result from iteratively composing CPU maps, the following maps 


i) := Bo (ids @ z=) : A” >B, n>1, E®:=E, (7.90) 


are also CPU . Consequently, the functionals p o E™® on A®” are positive 
and normalized. They are thus states on the local algebras A®”: moreover, 
the corresponding density matrices in Mq(C)®" Q M,(C) can be obtained 
by duality: in fact, 


Trg (0 [A ® B]) = Tryes (FIJAS B) . (7.91) 


where F : S(B) + S(A® B) is the trace-preserving dual map of E which 
transforms states over B into states over A & B. Analogously, to the CPU 
maps E™ there correspond the dual maps F™ : S(B) +> S(A®@+) @ B) 
given by 


F(™ := (id yon QF) oF") , n>1 FO =F. 


7 


Consider the states wj- defined on the local subalgebras Aj_¢4 by 


wj-0q(@keAn) = Tee (PE Kef) ® Isl) 
= Tre (FO) [p] (DfA) ® Is) . (7.92) 


As a consequence of (7.88) and (7.89), they satisfy the compatibility re- 
lations (7.85), that is wi—e_1,241;/Aj-¢g = wee, and the translation- 
invariance conditions (7.86), namely w_¢ = w{|-e+1,e41)- We illustrate these 
properties by means of the simplest non trivial case and choose Aj, 9); then 
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w(A® Ip) = Tra(p AS [la © Isl) = Tre(p [A8 1a]) = w(A) 


w(1ı 8 A) = Trs (PEIA ® E[A 8 1s]]) = Trs (pE[4 8 1g]) = w(4) . 


Therefore, the family of local states wļ—ẹ, defines a global invariant state 
overt the quantum spin chain Az. 


Definition 7.1.12 (Finitely Correlated States). Given a triplet (B, p, E 
as specified before, all functionals w on Az locally defined on Aj j} by 


nN 


w(@4_-Ax) = Trg (p EO) [o}_,Anl) , 


are translation invariant states called finitely correlated (FCS). 


Remark 7.1.14. The specification finitely correlated refers to the finite di- 
mensionality of the auxiliary algebra 6. Without such a restriction, every 
translation-invariant state over Az would be given as in the previous defini- 
tion. Indeed, one could then choose B := Aji 4.0}, P := w[Ajo,+0oj and as E 
the natural embedding of any Aq; j} into Aq 4.0). 


Because of translation-invariance, w |Ay j} = w MAp ,j—i+1]; therefore, the 
local structure of w is determined by the density matrices pj1,n] corresponding 
to w [Aj nj. They are recursively obtained by means of the dual maps (7.92), 


Pon) = Tre (FY p)) . (7.93) 


In order to take a closer look at the recursive structure of FCS, we make 
use of the Kraus-Stinespring representation (5.195); concretely, 


(A@B)=S Vi A@BV;, YS V{Vj=1s (7.94) 
jEJ jeJ 
Vi:Co Cat, v:oe, (7.95) 
where J is an index set of finite cardinality. With |¥A), i = 1,,...,d and 


bP), k =1,2,...,b two ONBs in Cf, respectively C°, the action of V; can 
be represented in the following two ways, 


b 
VilvP) = Y | WAR) @ lve) (7.96) 
k=1 


d 
Vile?) = dove) @ |e) » (7.97) 
f=1 
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where the vA ik? y8 wE C4 are in general neither orthogonal nor normalised. 
From (7. 96). it follows that 


b 
= >> Zf @vP (oP I, Vi SS REN WA, @ vp |. 
k= 


i,k=1 i,k=1 


Thus, $ jes V}V; = I whence -jey 5% (v4 Spk Y Ak = pq. On the other 
hand, from (7.91) one gets 


=), a (hp lobe ) Bfp) Vins | @ lr v7 | 


i,k=1 
jedi ae 


= S5 re lv (WA | (OP wv? |, (7.98) 
JEJ l,p=1 i,q=1 


where we have conveniently chosen the eigenprojections of p as ONB in C?, 
that is p = J`? rol bP )( WP |. As condition (7.89) ae to Tr 4F[p] = p, 


the vectors vA ‘ik Must also satisfy -jey oe rel wA AALE a) = fig igs 
By recursively inserting (7.98) into (7.93), one gets the following expres- 
sion for the local density matrices pj nj, 


= > 5 re Pi” yri” | , (7.99) 


jer Lp=1 
(n) 
| Vip = 5 | WA n) | Vi ih) S | Vå i)e 
E 
A 
@ |v fa 1iin-2tn-1 ) @ oer, : (7.100) 


Remark 7.1.15. Notice that, despite the recursive structure involving more 
and more factor components, for each n-tuple J) = jyjo--+jn € I there 


m) 
are at most b? vectors W7, € (C%)®". 


(n (n) 
Example 7.1.15. The vectors vi ? need not be normalized, LZA || 4 1; 
taking this fact into account, (7.99) provides the following natural decompo- 
sition of the local restrictions of FCS states, 


b 2 
(n jm) (n) ra|| £ | s(n) 
a= >. PG) ps Phy = >> G—P—— Pe”, (7.101) 


oe DDE 
L, p=1 
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(n) (n (a 
where Py, projects onto EA y / wd, |. It follows that the support of 
(n) 
each Phin] has dimension at most b?. By defining the completely positive 
(non-unital) maps A Q B 3 A 8 1 > E;[A 8 B] := V} A® BV; € B, the 
weights p(j™), j = jja -jn € T}, can be rewritten as 


b 
dk ;(n) n , 3 
p(l )) = Sore |v? |? = Tr(p pn fa haga jn gon 8 1]) : 
f=1 


Since })j<,E; = E, from (7.88) and (7.89) it follows that the probabilities 
nm") = {plj KIS jer» define a shift invariant global state wr over the Abelian 


algebra of generated by tensor products of infinitely many card(J) x card(J) 
diagonal matrices, thus a classical spin chain (D9~, O,, wz). 


As regards the action of V; in (7.97), we proceed as follows. Given the 
vectors |W?) ie € CÌ, where j € J, i = 1,2,...,b and £ = 1,2,...,d, let 
uly € M,(C) be the matrix such that (WP |v, |b?) = (bP (YP) Then, 


d b 

y= (Ivé)g ofl dP PI (7.102) 
é=1 i=1 
d b 

VEDELS (OF lose) - (7.103) 
£=1 i=1 


Then, (7.88) implies Ig = Yje7 V} Vj = Is = Dyes pea Vjevly. Further, 
the dual map F reads 


d 
A A 
JEJ p,q=1 
It then turns out that, in terms of the vjes, the translation-invariant condi- 
tion (7.89) amounts to >) ¥<) DA vl, puje = p. Finally, using (7.93), the 
local density matrices pjı,n] exhibit the following recursive structure, 


Pitn] = 5 |Y ftn) Ne Aes | Trg Cer pojmi ) ; (7.105) 


j) eI} 
kn) a eo 


where | D4) ) = ve, )®| ve, )® | A) and vja) = Viri Vinita" Vnin- 
The above expression is ae dels suited to deal with 


Definition 7.1.13 (Purely Generated FCS). A FCS w is called purely 
generated if the defining CPU E consists of only one Kraus operator [118]: 


(A®B)=VIA@BV. 
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In terms of (7.102) and (7.103), the map E and its dual F read 


a 


[Ae Bl= So (yfl Alf) ui Bo}, Flel= >> lve) (of |@ vf ow, 


i,k=1 i,k=1 


whereby compatibility (7.88) and translation-invariance (7.89) impose 


a 


sa vl = lg, Soul pvi =p. (7.106) 
i=l i=l 


The states page := F[p] on AQB and pi, = Trg (id 4 @F oF[p]) on Ap,2) 
can be explicitly written out. Notice that, because of translation-invariance, 
P{1,2] describes any two nearest neighbor spins: 


vipul ... vpo 
pass = X vi vj | @ vi pry = i a (7.107) 
_ oe 
Rigi as Ruja 
pma = X ffo] o + |, (7108) 
į j=1 š RA à 
` Dam. inn Dn 


where Rijek := Tr(vtu! PUevg). 


Example 7.1.16 (AKLT Model). A typical instance of the recursive finitely 
correlated structure is provided by the AKLT-model [4, 5], a spin-chain con- 
sisting of spin 1 particles with nearest-neighbor interactions described by the 
Hamiltonian 

1 


53 


= 1 ul 2 l 
59k Siar + (Se Sen) +3} ; (7.109) 
k=1 


3 
where Sk = (Siz, S2k, 93k) represents the spin operator for the k-th spin 
along the chain. 

The possible values of the total spin of two nearest neighbors are 0, 1 


and 2 with corresponding orthogonal projectors PA), P®), respectively PS, 
Therefore, since 


2 a 
Sk Sepi = -2PP® — PY + PLY, (Sri Siri) = 4P + PO + PP 


and pl) + p®) + pP = 1, it follows that the interaction between sites k 


and k +1 amounts to the projection P®) onto the subspace with total spin 
2. 
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spin 1 at site k can be described by means of two spins 1/2, labeled 
by k, k, by projecting with P, Rk c C* onto the 3-dimensional subspace 


orthogonal to the singlet state LA a of the pair of spins 1/2 at k and k. Fur- 


ther, after associating the spins i i k, k+1 with the pairs k, k, respectively 
k +1, &k+1 of spins 1/2, one imposes valence bonds between the pairs, by 


requiring that the spins 1/2 at k and k +1 be in a singlet state we ne pit 


follows that the common state of the pairs k, k and k+ 1, k+1, namely of 


two neighboring spins 1 is eigenstate of Pw) with eigenvalue 0. 
Further, by appending two spins 1/2 at the opposite ends, 0 and N + 1, 
of the spin 1 chain, it thus follows that the vector state 


(NS Perr IED) © WD) @--- WA, @ WE) (7.110) 
is the unique ground state for the Hamiltonian 
N-1 9 
H=y PY 4 = (1+ 85-51) ! al ' sv41- Sw) (7.111) 
k=1 


which is obtained from 7.109 by adding two boundary interactions involving 
the boundary spin 1/2 operators sg and sy¥+. In the limit of an infinitely 
long spin-chain, the above valence-bond construction provides a unique, 
translation-invariant ground state of the AKLT-model, known as valence- 
bond solid, which exhibits short-range correlations and an energy gap. 

In the limit of an infinite spin-chain, its ground state, the valence-bond 
solid, corresponds to the triplet (B,p,E) with B = Mo, p(B) = $Tr(B) and 


z : Ms 8 M2 > AQ B Vİ(AQ B)V EM, (7.112) 


where, with |b) 2) € C? the eigenvectors of the Pauli matrix o3 relative to the 
eigenvalues 1, —1 and |a1,2,3) the eigenvectors of S, relative to the eigenvalues 
=, 0, 1, 


2 1 
V|b1) = af least) a V3 |a2,b1) , V|b2) = — glasba) — - JŽ tas, 


(7.113) 


From (7.102), with o+ := (o1 + io2)/2, it thus follows that 


2 1 2 
kea a , a=. , n= [Po . (7.114) 


One can thus check that the conditions (7.106) are satisfied and, moreover, 
that the identity matrix l2 € Mə is the only solution of the second relation 
n (7.106) in agreement with the translation invariance and purity of the 
valence bond solid. Further, from (7.107) one explicitly computes 
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0 0 0 0 0 0 
[1-os Vio, o 0 2 y2 0 0 0 
0 v2 1 0 0 0 
pagg == | Voy 1 V20_]=- ; 
6 0 Vo 1+0 0 0 0 1 v2 0 
0 0 0 v2 1 0 
0 0 0 0 0 0 
(7.115) 
while (7.108) gives the nearest neighbor states 
00 0 0 0 0 0 0 0 
01 0 -1 0 0 0 0 0 
0 0 2 0 -1 0 0 0 0 
,{9 e e 0 1.0 0 0 0 0 
pug=5g{9 9 -1 0 1 0 -1 0 0 (7.116) 
0 0 0 0 0 1 0 = 4 
0 0 0 0 =f 0 2 0 0 
0 0 0 0 0 <1 0 1 0 
00 0 0 0 0 0 0 0 


Remark 7.1.16. Finitely correlated states are a useful arena for investigat- 
ing the behavior of entanglement in quantum spin chains for these are deter- 
mined by the triplet (B, p, E) (see [38, 204]). Furthermore, they are particular 
important as ground states of certain solid state Hamiltonians whereby one is 
interested in either the relations between entanglement and long-range order 
effects [227] or in the possibility to create entanglement between distant sites 
by suitable local measurements [241]. 


Price-Powers Shifts 


The so-called Price-Powers shift [246, 247] are quantum dynamical systems 
described by an infinite-dimensional C* algebra Ag, whose building blocks 
are the identity operator l and operators e;, i = N, satisfying 


e =l. (7.117) 
Their algebraic properties are determined by a function 


called bitstream: according to its values, different e;’s commute or anticom- 
mute 


exe; = (-)"* Mere, VijEN. (7.119) 


7.1 Observables, States and Dynamics 373 


By means of the above relations, every product of finitely many e,’s can be 
reduced (up to a sign) to an operator of the form 


Wi = Ci Cig Cin 3 (7.120) 


where 2 = iii2-- -in E N* stands for any choice of finitely many indices such 
that i, < i2 <... < in: 4 will be called the support of W;. 

It is convenient for later purposes to explicitly compute the commutator 
of two operators W; and W; with supports 4 = iiiz- -in and j = jij2-++jm: 


[Wi, Wj] = Wi W3 (1 (rE allei) (7.121) 


By means of the operators W; one constructs finite-dimensional local subal- 
gebras. Indeed, again by means of (7.117) and (7.119), products of W;, with 
2’s consisting of indices from a same interval [p,q], p < q, reduce up to a sign 
to some other W; from the same interval. Thus, the algebra Aj, q] generated 
by W; with z from [p, q] is a finite-dimensional unital C* algebra that can be 
embedded into the spin algebra M2(C)®(4-?+!, This becomes apparent by 
representing the operators e; by means of tensor products of Pauli matrices: 


j-l 
ej = (29); 8 (ce); 8 lyt - (7.122) 
i=1 
Since 0,0, = —0,0,;, one can check that the relations (7.119) are indeed 
satisfied. The local C* algebras generate the *-algebra 
Ay = U Ain] , 


n>1 


and by norm closure (for instance as a subalgebra of the quantum spin chain 
Q7; M2(C)) the quasi-local algebra 


A,:=A,"". (7.123) 


As for quantum spin chains the dynamics on A, is given by the shift to the 
right of the support of the operators W;: 


Wi > Of [Wi] =: Witt = ei, 4teinttein4e, CEN. (7.124) 


Proposition 7.1.10. Let Wg := 1; the linear functional w : Ag +> C ob- 
tained by setting 

w(W;) = Òig (7.125) 
and by linearly extending it to A, defines a tracial O,-invariant state on 
Ag. This is the only tracial state on Ag if and only if the following property 


holds: for all finite supports i = iyi2--+ in € N* there exists k € N such that 
Eo g(|k — iel) is odd. 
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Proof: The positivity of w can be checked by setting W = y ciWi, ci EC 
and considering 
w(WtW) = 5 GG w(WÌ W;) - 
i,j 


The expectations w(W} W;) vanish unless wi W; = 1 which can be true 
only if each e;, in W; is matched by an e;, in W;. Thus, w(W} Wi) = 053 
whence w(W'tW) > 0. Therefore, w is positive, thus continuous (see (5.49)) 
and can be extended by continuity to the whole of Ag. That w o Os = w 
follows directly from (7.124) and (7.125). Such a state has the tracial property 
w(W; W;) = w(W; Wi). 

Suppose that a state w on A, has the tracial property and that for all 
finite supports i = iriz- -in E N* there exists k such that X5% 9(|k — iel) 
is odd, then [10] using (7.117) and (7.119), one gets 


w(W;) = Dle Wi) = w( ex, Wi ek) 
Therefore, w(W;) = 0 for all W; # 1 whence © = w. 
Viceversa, if there exists a support 4 = 7172---i, such that for all k € N 


re g(|k— i|) is even, then (7.121) implies that W; commutes with A, and 
thus with Ag, whence, setting W := 1+ W; (w(W) = 1)), it turns out that 


Ag > Wj = w(W;) = w(W W;) ; W; E Ag, 


defines another state on A, with the tracial property. 


Remark 7.1.17. The property that ensures the uniqueness of the tracial 
state w defined by (7.125) is guaranteed by non-periodic bitstreams [222]; we 
shall assume this in the following. 


Definition 7.1.14 (Price-Powers Shifts). We shall call Price-Powers shifts 
the dynamical triplets (Ay, Onw) constructed as above with a unique invari- 


ant tracial state w. 


Examples 7.1.17. [13] 


1. The von Neumann algebras Mg := Tu( Ag)” that arise from the strong 
closure of Ag in the GNS construction based on w are hyperfinite. By 
extending O and w to M, one gets von Neumann triplets (M4, O, w) 
with w still a unique tracial state. 

2. If g = 0 then (M,, Oc, w) is an algebraic version of the classical balanced 
two-valued Bernoulli shift (22, To, u). The von Neumann algebra Mo is 

1 


generated by the projections p; := = which are orthogonal for a 


same index 7 and otherwise commute. 
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3. If g # 0, because of the existence of a unique normalized trace, all Mg are 
hyperfinite factors of type IJ. Indeed, their center Z, := M,N M, =O, 
otherwise there would be a positive Z € Z, with w(Z) > 0 such that 

Wi ae, 

One) ~~ @) 


the uniqueness of w: 


gives a different tracial state on M, contradicting 


w(Z Wi W;) = w(W; Z Wi) _ w(Z Wj Wi) 
wz(Wi W;) = w(Z) = w(Z) ion w(Z) 


= wz(W; Wi) . 


4. If g(1) = 1 then A, amounts to a discrete Fermi algebra endowed with 
an infinite temperature state. Indeed, ejej + eje; = 0 if i Aj so that the 
operators f , 

Pre e2i—1 +1 ezi ee t eai 
2 oOo 2 : 
i > 1, satisfy the CAR (5.62). Moreover, the expectations 
di 
w(ajay) = = 
are those of a Fermionic KMS state at infinite temperature (see Exam- 
ple (7.1.3)). 

5. The bitstream can be chosen such that the von Neumann dynami- 
cal triplets (Ag, Oo, w) are asymptotically highly anti-commutative [222]. 
This means the following: there exists a subset S C A, such that 1) the 
set 1US is dense in A, and 2) for any S € S, € > 0 and N €E N there exist 
O< ny <ng<---<ny EN such that the anti-commutators satisfy 


jers. nis: 


for all n; A nj. In this case the tracial state w turns out to be the only 
state which is invariant under the shift O,. Indeed, choose S € S and set 


X= ee [S], then 
[xix xxi s Zor + Lorton} 
tAj 
< Sys? + A 


Further, if v is a translation-invariant state on Ag, then v(X) = v(S); 
thus, by applying (5.49), 


|v(S)| = |v(X)| < 5 (uxt X) + V(x xi) < oes: X Xt) 


<5 aN = 1) 
= N ` 2N ` 
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Because of the arbitrariness of N and e > 0, it follows that v(S) = 0 for 
all S € S and because of the assumed density of the set 1US, v coincides 
with w as defined in (7.125). 


Like in Example 7.1.12, Price-Powers shifts are weakly mixing for all 
bitstreams; indeed, because of the quasi-local structure of the algebra and 
the of the tracial property of w, one need only study the asymptotic behavior 
of 

w(WiOo[W;]) = w(Wi W542) - 


Clearly, for sufficiently large t, i N (j + t) = 0, then 


lim o(WiGo[W5)) = 0, 
unless i= j = Í. 

As regards strong-mixing, it is convenient to consider strong-asymptotic 
Abelianess first; namely, at its simplest, using (7.121), it turns out that 


w (ler, ejelt les, ej42)) = (1 — (apatite) 


Therefore, unlike for weak-asymptotic Abelianess, the possibility of strong- 
asymptotic Abelianess depend on the asymptotic behavior of the bitstream; 
for instance, highly anti-commutative Price-Powers shifts cannot be strongly 
asymptotic Abelian. 


7.2 von Neumann Entropy Rate 


As seen in the introduction to this chapter, the usual setting of quantum sta- 
tistical mechanics consists of a quasi-local algebra A which is the C* inductive 
limit of local C*-algebras Ay C B(Hy) of operators localized in finite vol- 
umes V C R?; also, A is equipped with a locally normal state, namely with a 
state whose local restriction to Ay, w|Ay is a density matrix py € B} (Hy). 
Usually, w is translation-invariant, that is pyia = py, where V + a denotes 
the volume V rigidly translated by a € R? (or by a € Z? in the case of a 
lattice system). 

Consider two disjoint volumes V; and V2 and let V := Vi U Vo; then, 
Ay = Av, ® Av, and pv, , = Try, , py, namely the states localized within 
Vı,2 are obtained as marginal states of py localized within the larger volume 
V. Each local state py has von Neumann entropy 


S(V) := S (pv) = —Tr(pv log pv) ; 


then, the subadditivity of the von Neumann entropy, that is the upper bound 
in (5.161) reads 
S(V) < S(V1) + S(V2) , (7.126) 
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where the equality holds if and only if py = py, @py,. In order to understand 
the meaning of strong subadditivity in this setting, consider two volumes V 
and U such that Vo := VOU # @ and set Vi := V \ Ve, V3 := U \ Va, 
W=V,UWUV3 =U UY. Since V123 are disjoint volumes, it follows that 
Aw = Av, ® Av, ® Avs, Av = Ay, & Av, and Ay = Av, Q Avs; further 


pv, = Truy, 98v, (Pw) , pv = Truy, (ew), pu = Truy, (Pw) - 
Then (5.162) reads 
S(UUV) + S(V2) < SU) + S(V). (7.127) 


In general, the von Neumann entropy of py diverges when V f R° (or V 1 Z?); 
on thus wonders whether the rate S(V)/|V| exists when the V Ì R, Z3, 
where |V| = i dr. Among the many ways a sequence of volumes may fill 
the whole space R? (or Z?), a convenient one [314] is to consider a family of 


parallelepipeds V(a) := fæ = (z1, £2,%3) € R? : =a; < 2; < ai}, where 
a € RÌ and then to let each a; => +00 so that V (a) > R?. 


Proposition 7.2.1 (Mean von Neumann Entropy). /314] 
If (A,w) is a quasi-local shift-dynamical system with a locally normal 
translation invariant state w, its mean von Neumann entropy is given by 


Z a SVa) p SVa) 
so) = ames TVA vin Vo eae 


Proof: [314] Because of translation invariance, in (7.128) we can consider 
parallelepipeds of the form V(a) = {x eR? :0<2;< ai}. Choose € > 0 
and a parallelepiped V (ao) in such a way that 


_ aap SV) , SV) 
e Vial a 


(*) 


By decomposing R+ 3 a; = n;aĝ +b; with n; € N and 0 < b; < aj, any other 
V(a) can be written as the union of disjoint parallelepipeds 


V(a)= |] Velao) U Ve(ao) 
O<ky<ny-1 
O<kg<ng-1 
O<k3<ng-1 


Ve (Qo) := {x ER: kai <a; < (ki + Lah} 


Va (ao) := C ER’: niab <qzi< niab + bi} 


Then, (7.126) yields 


378 7 Quantum Mechanics of Infinite Degrees of Freedom 


o< (I n) sive ) + S(Vo(ao))  (**). 


|V(a)| 


By translation, Vp(ao) can be embedded within V (ao); furthermore, since 0 < 
bi < aj, each of them is the intersection of the interval [0, aj] with an interval 
[—ci, a’) — ci], ci > 0, of the same length. It thus follows that V,(ao) can be 
written as the intersection V2 of V := V (ao) with a suitably translated V (ao) 
denoted by U. Then, from (7.127) and translation-invariance one derives the 
upper bound 


S(Vo(ao)) = S(V2) < S(U U V) + S(V2) < S(U) + S(V) = 2S(V(ao)) . 


Finally, dividing («*) by V(a) and going to the limit, similarly as in the proof 
of the existence of the Shannon entropy rate in (3.2), using (*) one gets 


S(V(a)) _ S(V (ao) S(V(a)) 
eis Voj < ACAI < s(w ae e +e 


|V(a)| 
whence the result follows from the arbitrariness of £ > 0. 


Examples 7.2.1. 


1. For quantum spin chains (Az,0,,w), the mean entropy is given by 
. 1 el 
s(w) = lim —S (pum) = inf —S (Pin) > (7.129) 


where pj ,n) is the density matrix corresponding to the restriction w [Aj ,n] 
of the translation invariant state w to the local subalgebra Aj, n]: 

2. Because of their structure (see Remark 7.1.15), purely generated FCS 
w have s(w) = 0. Indeed, the support of local states pj ,n] is at most 
b?-dimensional where M;,(C) = B is the auxiliary algebra in the triplet 
(B, p,E); then S (pp nj) < 2 log, b. 

3. Consider the Bact (7.22) and Fermionic (7.20) quasi-free states w4 
and assume the action of the operator A on L3,. (R°) to be given by 


(r144) = f de Kalra W0(e) , 


where the kernel K4 has Fourier transform 


K(k) := ar f. da e*® K4 (æ) 


such that 0 < K4(k) < 1 for Fermions, 0 < K4(k) < M < + for 
Bosons. These quasi-free states are translation-invariant and their mean 
entropies can be explicitly calculated [233, 110], 
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dk (nRalk)) +1- Ralk))) (Fermions) 


nN 


n(Ra(ke)) — n + Ralk))) (Bosons) . 


a 
Eo 
os. 


Remarks 7.2.1. 


1. The entropy density of quantum spin-chains scales as the power of the 
shift-automorphism, that is the entropy production per length £ time-step 
is £ times the entropy production per unit time-step: 


1 
si(w) := lim =S (p8) = ls(w), (7.130) 
n= n 
where p\") = w MMo,ne—1. Indeed, since the limit in (7.129) exists, it can 


be computed as 


, 1 1 

s(w) = „im, mS (Pung) = 7 Se(w) . 

2. From (5.156), it follows that the entropy density is affine over all convex 
decompositions of O,-invariant states w of quantum spin-chains into O,- 
invariant components wj. Namely, if w = oz, Ajwj, with 0 < A; < 1, 
2; Àj = 1, then 


slw) = 5 Aj 8(w;), WEEN. (7.131) 


3. In the case of a decomposition of a translation-invariant state w of a 
quantum spin-chain into 6£-invariant components wj, the previous two 
points give 


$e(w) = 5 Ajselwj) = DD Ajslwj), VEEN. (7.132) 


Let us consider the decomposition of a O,-invariant state w over a quan- 
tum spin-chain which is not O£-ergodic into ne O4-ergodic states (see Propo- 
sition 7.1.9). 


Lemma 7.2.1. Given the decomposition w = + D using the nota- 


ne 
tion of Remark 7.2.1, it turns out that 


1. all states w; have the same entropy density with respect to o£: se(wj) = 
se(w), OS j < ne— 1. 
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= 
subsets of states 


2. Set g =S (o?), sO := 7S (eo) and fiz n > 0; it turns out that the 


Atn i= fw; : a” > s(w) + n} (7.133) 
has asymptotically zero density. Namely, if #(Ae,) denotes its cardinal- 


ity, then 
jim ea) _ 9 | (7.134) 


ng —> oo rhe 


Proof: 
Part 1 Because of (7.132), se(w;) = se(wo), for 1 < j < ng — 1. This fact 


? 


follows from subadditivity (5.161) and the fact that 


p = wo 0 (S |Ajone—1] = wo [Al-j,ne—j-a] : 
———n 


„=pl-37t-i-1] 
= 


Indeed, split the intervals |- j, nl — j — 1], 0 < j < ne — 1 into disjoint 
pieces (notice that, according to Proposition 7.1.9, ng < £), 


[—j,né— j — 1] = [-7, 4 — 1] U [E nl — L- 1JU [ng — L, nl- j 1]; 
a ae, A er” 
Ih I2 I3 
(n£) 


then, apply (5.161) to the density matrices p}, respectively py 3 @ pa, 
and use translation-invariance together with the bound (5.155). It then 
follows 

9 (of) <s (ac) +439 ‘Cage <S (a= + 2 logs d. 
Vice versa, if instead of subdividing the interval |- j, nl— j— 1] of interest, 
we include it as a disjoint piece in a larger one 


a E a 
Iı Ig I3 


then, subadditivity and boundedness give 
g Ca >s aes) _g (a) >s Tay — 2log d . 


Dividing by n and taking the limit n — oo yield the result. 


A 
Part 2 If there were no such that lim sup #(Aeno) =a > 0, then there 
ne—oo Ne 
#(Ag; mo) 


would be a subsequence £; such that lim = a. Then, since 


j—oœœ Ne; 


ne,—1 
pi) = 5 p , subadditivity implies 
k=0 
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2 #(Ag; no) (s(w) +70) F HAL ng) a gf) à 


The previous point, (7.130) and (7.129) obtain 


1 ml; . 
Lj s(w) = se (wj) = inf zS (0; “n <2; am whence 
Ae, AG, 
g's) > #( tm) (6(.3) #( bno) 5) : 


Ne; 


When Ne, > œ, a contradiction arises: 


s(w) > (s(w) + n)a + s(w)(1— a) > sw) . 


7.3 Quantum Spin Chains as Quantum Sources 


Quantum spin chains (Az, 0,,w), with A = Ma(C), provide useful algebraic 
descriptions of quantum sources whose signals consist of quantum states act- 
ing on Hilbert spaces of increasing dimension. The local states p( obtained 
as restrictions of w to the local subalgebras A™ := Ajin} describe ensembles 
of quantum strings of length n emitted by these sources. 

Quantum sources are one of the two ends of quantum transmission chan- 
nels; like their classical counterparts, these consist of a source, a sender who 
encodes, a channel which transmits and a receiver which decodes. Channel in- 
puts and outputs are generic quantum states and the encoding and decoding 
procedures, as well as the channel action are quantum operations described 
by trace-preserving CP maps on the state-space. 

In analogy with Figure 2.2, a quantum transmission scheme can be pic- 
torially represented as in Figure 7.1. 


1. At each stroke of time, a source A emits quantum states, represented by 
density matrices p; € B} (H), i = 1,2,...,a, H = C*, with weights p(i). 
The statistical description of a single use of the source is given by means 
of the density matrix p = )°"_, p(i) pi. 

2. As a result of n uses of the source, the sender would collect generic 
density matrices pi), € BY (H®”), i™ = iiiz- -in € Aa), with weights 


Er 


p™(i™). Consequently, the statistics of n uses of the source is described 
by the density matrix 


382 


Ber 


i = 


. The encoding is a trace-preserving CP map E™ : B} (H®”") — B} (K; 
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2 ~ (0) 


<—: decoder: 
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S 
S 
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3 
Q 
> 
D 
3 
3 
2 


Fig. 7.1. Quantum Transmission Channel 


prim F pap mri (7.135) 
imen 


which embodies purely classical correlations (the weights) and quantum 


correlations due to the states EN 


(m) 


n 
such that €( [0] = a are density matrices that can all be consid- 


ered as acting on a same (finite dimensional) Hilbert space K™), 
(n) 


. The code-states oin) go through the (lossless) channel that transform 


them as a trace-preserving CP map F™ : BY (K°”) > B (KC? 


in out 


) such that 
FO [o s = ce the latter being a, possibly not-normalized, positive 


matrix acting on a (finite dimensional) Hilbert space Km). 
~(n) 


. The channel output o. ae finally undergoes a decompressing procedure 


corresponding to the action of a CP map D™) : B,(K)) > B+ (H®") 
such that D™ [o (n) ol = ae 


. The efficiency of the encoding-decoding procedures with respect to the 


channel action F is measured by how faithfully the decompressed states 


TA reproduce the input states p: 


The simplest instance of quantum source is the generalization of a classical 
noulli process: at each use of the source, vector states | y;) € H := C4, 
1,2,...,a@, (not necessarily orthogonal) are independently emitted with 


weights p(i). The quantum statistics of n uses of the source is thus described 
by the density matrix 


j=l j=l 


im EQ” 
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where p = Ypa PC) Yi )( Ye | and p™ (&™) = TI- lis). 


Remarks 7.3.1. 


1. Quantum spin chains as they appear in quantum statistical mechanics 
provide fairly general models of quantum sources. Their local states over 
n successive chain sites correspond to density matrices p°”) that describe 
a variety of possible quantum strings of length n consisting of separable 
and entangled states that can in turn be pure and mixed. 

2. Like classical strings, quantum strings emitted from quantum sources of 
Bernoulli type can be chained together by tensorizing them; this is not 
anymore so obvious for generic quantum strings [60]. 

3. Two classical strings can always be told apart, for instance by a non- 
zero value of the Hamming distance that counts by how many symbols 
they differ. Instead, there are uncountably many quantum strings that 
can be arbitrarily close to one another, for instance with respect to the 
trace-distance (6.66), and which cannot then be perfectly distinguished. 


7.3.1 Quantum Compression Theorems 


In analogy with classical coding, the idea how to compress quantum infor- 
mation in absence of noise is to consider quantum strings acting on Hilbert 
spaces H®” with n large and to map them into quantum strings acting on 
Hilbert spaces H”) of smaller dimension in a way that allows for faithful 
decompression. 

Concretely, the procedure consists in a coding operation corresponding to 
a trace-preserving CP compression map E£™ : BY (HI®”") +> Bi (H™) and a 
decoding operation described by a trace-preserving CP decompression map 
D™ : BY(H™) — By (H®") that tries to retrieve the source signals. If the 
task is to compress the information contained in n uses of a quantum source, 
then each of the quantum strings in (7.136)) is subjected to the chain of maps 


Pye? OO = EM [pO] 3 A), = D™ [oO] . (7.137) 
Any sequence {€), D(™},, will be referred to as a compression scheme and 
denoted by (E€, D). 

In the following, we shall first focus on quantum Bernoulli sources emitting 
qubits, that is we shall consider Hilbert spaces H8” = C?” and local algebras 
A(™ = Man (C). For them, the compression rate of a scheme (C, D) is defined 
as follows. 


Definition 7.3.1 (Compression Rate). The compression rate of (£, D) for 
a qubit quantum source (Az,w) is given by 


1 
R(E) := limsup — log, dim(H™) . 


n—+oo n 
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where H™ is the minimal support subspace of all quantum code-words Gon 


According to the previous definition, for large n, 2” P(E) estimates the di- 
mension of the subspace supporting the encoded signals, with R(E) roughly 
being the used number of qubits per encoded qubit . eel one looks for 
compression schemes (E, D) such that R(E) < 1 with D™® o €™ asymptoti- 
cally approximating the identity map in a suitable ie 


Compression of qubit Bernoulli Sources 


In the case of a Bernoulli quantum source, Shannon’s noiseless coding the- 
orem 3.2.2 has a natural quantum extension whereby the von Neumann en- 
tropy plays the role of the Shannon entropy as optimal compression rate. 

A convenient fidelity is the ensemble fidelity introduced in Definition 6.3.6; 
using (6.71) it reads: 


Feo ({p eae p™ agt) = ` P Taa B) - (7.138) 
imenf™ 


It is positive, bounded by 1 and equal to 1 if and only if p= = ae Useful 
upper and lower bounds to Fy, are obtained as follows. 
If p= eer r;|r; )(r;| € S(C?) is the spectral decomposition of the 


state describing a single use of the source, the eigenvalues oe of p®” are 


of the form es =: [irz Let H™ C H8” be the smallest subspace, of 


dimension d(n), supporting all code-words 2 and let P™ : H8”? = Hi”) 
denote the corresponding orthogonal projection. Then, 


Fa( {pime p PO CEM) < D pi (pF) 
Mens” 


d(n) 


< X Cee (7.139) 


Indeed, the first inequality is implied by the fact that n < Pr”, while the 
second one is the Ky Fan inequality (5.158), where e;(p8”), j = 1, 2,- -,d(n), 
are the first d(n) largest eigenvalues of p8” 

Vice versa, given  : H8” ++ H™), consider the trace-preserving CP 
map €™) : BY (H®") > Bf (H™) defined by 


E= rM prM+ XO [0X Slol], (7.140) 
|B, ) LK) 


— 


loyor (a-pe) 
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where |0) € H™ is a suitable reference state. As a decompression map D\™, 
choose the identity map on H) which embeds it into H®”. Then, 


A, =r pf, P™ + Jooh- P) 2) , 


a i 


. n 
whence, since p 


2 
Fa( {piep DE) J oio Thor) ) 


) is a pure state, 


Mea” 
2 
> = Be (Troe r))) 2 5 Ain) (2 Teha p) 2 1) 
merh iln) 
L a — 1. (7.141) 


Exactly as the Shannon entropy in the classical case, a theorem of Schu- 
macher [267, 159] shows that, for quantum sources of Bernoulli type, the von 
Neumann entropy S (p) is the optimal compression rate. Namely, this rate 
can be achieved by suitable compression and decompression schemes with 
high-fidelity retrieval of increasingly long qubit strings; on the other hand, 
compression and decompression schemes with rates exceeding S (p) perform 
poorly with long qubit strings. 


Theorem 7.3.1 (Schumacher Theorem). 

Let (Az,p®~) be a qubit Bernoulli source with entropy density S(p). If 
R > S(p) there exists a compression scheme (E,D) with rate R(E) = R and 
ensemble fidelity Fa, tending to 1. On the contrary, if R < S(p), then for 
every compression scheme such that R(E) = R the ensemble fidelity tends to 
0 in the limit n — oo. 


( 

j 

) = Pohea on the strings j™ E ay with Shannon entropy 
2 


Proof: The eigenvalues r By of p®" provide a probability distribution 


(n 
T 
n S(p). According to Proposition 3.2.2, for any 6 > 0, € > 0 and n large 
enough, there exists a subset Al” of probability 


Prob( A”) = Y ri) = ain i) S15 
jens 
and cardinality d(n) satisfying 
(t= 8)27(S(P)-6) < d(n) < gnr(S(p)+e) 


Let I” project onto the subspace linearly spanned by the eigenvectors 
ee ) = |rj,) 8 | rj. )-+: ® | r;, ) corresponding to the eigenvalues rs 
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je AM”), Such I can be used to construct the compression map (7.140); 
hence, from (7.141), Fav > 1— 206. Also, the bounds on d(n) ensures that any 
rate R < S(p) is achievable. 

Vice versa, if d(n) < 2"), then, according to Theorem 3.2.2, given 
the probability distribution m™ = Chena ERM» 
d(n) strings has vanishingly small probability, ` 


any subset Ban) with 


Prob(Bain)) = 5 ne = ma Pan) <e 
JEBan) 


for n large enough, where Ign) projects onto the subset spanned by the 
eigenvectors relative to the eigenvalues indexed by j™ e Ban). It then 
follows that also the sum of the first d(n) largest eigenvalues of p®” must be 
smaller than € and so also Fay < £ because of (7.139). O 


Example 7.3.1. [159] In a single use, a Bernoulli qubit source emits the 
non-orthogonal states 


lyo) = V1 —e|0) + vell) , |41) = v1- e|0) — vel1) 


where 0 < £ < 1/2, with probability 1/2 each; the corresponding statistical 
ensemble is described by 


p= sl¥o)(vol+ slvs)(vr| = 1 —e)10){0] +e]1){1 


Suppose that, given the 3-qubit strings | Viisi) = | Vi, ) 8 | Vin) 8 | Vis ), 
only two qubits can be transmitted; how can the transmission of quantum 
information be optimized? 

Since (0| 40) = (0|d1) = VIZE, (1| Yo) = —(1| vr) =e and € < 1/2, 
the sender may encode and decode each 3-qubit string by tracing over the 
third qubit and appending the high probability state |0)(0| in its place: 


ala 3 3 
| WPirizis ) Wiriris | => Ds = Di 2 E ji Wirinis ) Virizis |] 
= | Pirin )(Viriz | & |0)(0| : 
The average fidelity then results 
~(3 
FO T 5 ( Pirizis es [Dis ivis ) =l-e. 
21 ,22,73 


A better strategy arises from considering the components of a 3-qubit string 
along the eigenvectors of p83: 


|( 000 | i, izis | = (1 => ere [, 110 | Viriziz )| =éevl—eée 
|(001 | Wi, i285 | = (1 — E) VE |( 101 | His inés )| = € L=<€ 
|(010| Piisi | = (1 —e) VE i [C011 | Pirizis | = EVv1-— € l 
K 100 | Wirinis )| = (1 = E)VE K 111 | Virizis | = 63/? 
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Since € < 1/2, the eigenvectors | 000), | 001), |010) and | 100) provide higher 
probabilities than the second four eigenvectors; let P project onto the linear 
span. Observe that the unitary permutation 


|000) ++ |000) |111} = |001) 
y. J 1001) = |010) |110) + |011) 
' į [010) = |100) |101) |101) ’ 
|100) > |110) |011) = |111) 


is such that UPUt = 112 ®|0)(0|, where 112 = $; jo |4j) (ij | is the iden- 
tity matrix of the first two qubits . Therefore, one can construct a compression 
map as follows; first, introduce the trace-preserving CP maps 


[ol = PpP + |000) (000 | Tr ( (1 — P) p) 


a 
z 


Ut = Tho [0)(0|(Ut pV) the @ |0){0| 


+| 000 )( 000 | Tr(( = P) p) . 


Then, define EP : BY (C3) — BI (C?) as EP [p] = EP [U E[p] U"] and De : 
BY (C2) 4 BY (C3) as DY? [a] = Uto @|0)(0|U. It follows that 


DË o€ Ip] = PpP + |000 ){ 000 | Tr ( (1 — P) p) . 


Thus, with ae) = pP 2 EPI Virizis M Diiis I 


Piiizis 


(Wirinig |) ig irinis ) = | Pirints |P Wiizis )? 
= 1 — 9e? be — Oe" + De”. 


As the right end side of the last inequality is the same for all 3-qubit strings 
considered, this is also the value of the fidelity FÊ). As shown in the figure 
below, the latter turns out to be larger than FY) for0<e<1 /2. 


Compression of Ergodic Quantum Sources 


As showed in Section 3.2.1, Proposition 3.2.2, Theorem 3.2.1 and Theo- 
rem 3.2.2 establish the role the entropy rate as the optimal compression 
rate of classical ergodic sources. Theorem 7.3.1 assigns the same role to the 
von Neumann entropy in the case of quantum sources of Bernoulli type. 

For this particular family of quantum chains, the von Neumann entropy 
coincides with their entropy density as defined in Section 7.2; it is thus ex- 
pected that a kind of general Quantum Shannon-Mc Millan-Breiman Theorem 
should hold for generic ergodic quantum sources. The relevance for quantum 
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Fig. 7.2. PF — FO) against 0< €< 1. 


information of quantum spin chains endowed with states with a more general 
structure than a tensor product has been emphasized in Section 7.3, where 
analogies and differences between classical and quantum contexts have also 
been outlined. 

In particular, in Remark 7.3.1.2 it was pointed out that, unlike for classi- 
cal bit strings, one cannot profit from any natural ” chaining together” qubit 
-strings. Though ideas how to circumvent such a problem have been put for- 
ward [60], this fact represents an obstruction to a full quantum generalization 
of the classical Breiman theorem. The latter is an almost everywhere state- 
ment regarding single sequences, while the Shannon-Mc Millan formulation 
is concerned with statistical ensembles; of this theorem there exist a number 
of extensions to particular non-commutative settings [223, 240, 169, 94] and 
a full quantum extension [58]. This general result has then been used [59] to 
devise compression protocols for ergodic sources consisting of encoding and 
decoding procedures similarly to what outlined in the previous section. 


Theorem 7.3.2. Let (Az,O,,w), with A = Ma(C) as site-algebras, be an 
ergodic quantum spin-chain with mean entropy s(w). Then, for all 6 > 0 
there is Ns E€ N such that for all n > Ns there exists an orthogonal projection 
pn(d) E An such that 


1. w(pn(5)) = Trn(o™ pa(d)) > 1-6, 
2. for all minimal projections 0 Æ pn E€ An dominated by pr(d) (p < pn(d)) 


(L= 0a e) < w(pn(ô)) < g—n(s(w)—9) , 


3. gn(s(w)—ô) < Trn(pn(5)) < gn(s(w)+s) 


That the above results extend Proposition 3.2.2 is apparent: classical high 
probability subsets are replaced by orthogonal projections pn(ô) whose sta- 
tistical weight with respect to the translation-invariant state w is nearly 1; 
further, the typical subsets correspond to orthogonal projections whose nor- 
malization (dimension of the associated Hilbert subspaces), Tr(pn(d)) goes 
as 2”8(“) for large n. 
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Like in the case of a Bernoulli quantum source, the proof of Theorem 7.3.2 
£ 
hinges upon considering the discrete probability distributions r®) = { 7 yi 
consisting of the eigenvalues of the density matrices p®) describing the restric- 
tions w to the local subalgebra Ap = M,(C)®* with spectral decomposition 
dé 
p) = 5 7) | r y(n |. The Shannon entropy H(z) equals the von Neu- 


i=l 
mann entropy S (0); further, from the definition of entropy density s(w), 
given 7 > 0, for infinitely many @ one has 


slal r 1 1 
slw) = inf z5 (o ) < 75 (0) = pi) < s(w) +7. (7.142) 


For Bernoulli quantum sources, the products of eigenvalues of single site 
density matrices provide a natural Bernoulli stochastic process, whose en- 
tropy density s(w) is exactly the von Neumann entropy of p. Such a struc- 
ture is missing in the case of generic ergodic quantum source. However, 
from (7.142), one observes that choosing £ large enough, S (p?) ~ Ls(w). 
£) 


Moreover, the eigenvectors |r; )(r | are minimal projections generating 
a maximally Abelian subalgebra © C Ay and the eigenvalues 7 define 
a probability 7 over the symbols i € I := {1,2,...d°}. By tensorizing 
copies of the Abelian subalgebra D, one can embed the Abelian subalgebras 
Dn := D8” into the local algebras Ane and C*-induction yields a quasi-local 
Abelian algebra D° embedded into the quantum spin-chain Az. 

The Abelian algebra D% is clearly associated to a triplet, or symbolic 


model (ñ, Lor pO) (see Definition 2.2.5 and the preceding discussion) where 
Ñr is the space of sequences of symbols from J, Tọ is the shift along these 


sequences and pi“ is the measure on Nr that arises from 7. Further, 
from (7.130), the (classical) entropy rate is 


ni) = ints (0) = a)(w) = Ls(w) . (7.143) 


n=œ n 


As the automorphism over D% corresponding to the shift Tọ on Qi is not 
Os, but its th power O£, the Abelian spin-chain associated to the symbolic 


model of above is ( OF w ar) (see Defintion 2.2.5 and the preceding 
discussion). The state w is @,-ergodic, but not in general @4-ergodic. If it 
were O£-ergodic, (oP, Of w aF) would amount to an ergodic process and 


we could use the classical techniques as in Proposition 3.2.2 with the mean 
entropy h(w DZ) = £s(w) in the place of the Shannon entropy as follows 
from the classical Shannon-Mc Millan-Breiman result in Theorem 3.2.1. 
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Remarks 7.3.2. 
1. The ergodicity of the embedded Abelian spin-chain ( © Ol, w ar) 


would follow from that of the quantum spin-chain (Az:9%,w), since 


otherwise the resulting decomposition of w |D? into ergodic components 
would provide a decomposition of w as well. 

2. In [240], the quantum Shannon-Mc Millan theorem was proved under the 
assumption of 0 ergodicity of the spin-chain state w: such a property is 
known as complete ergodicity. This restriction has been removed in [58]. 


The possible lack of O4-ergodicity can be overcome by means of Propo- 
sition 7.1.9 and of Lemma 7.2.1. Indeed, the argument of above can be de- 
veloped for the @4-ergodic components w; indexed by j € Abn for which 


se(w;) = €s(w) and ee =45 (oP) < s(w) +n, for some fixed 7 > 0. For 


() 


each of these wj, one considers the local states p} over A, the probability 


(6) corresponding to their spectra, the Abelian subalgebras D/ 


distributions T) 
generated by thet spectral projections pe , i € Ij, and the associated ergodic 
Abelian spin-chains (De, 68, wj ar): Because of the bound (3.7) in Re- 
mark 3.1.1.1 and of the choice of indices j € AS 
rates 


Z these chains have entropy 


hj < H(n 9) = S (an) < L(slw) +n). (7.144) 


After identifying strings i) of symbols from I. j with minimal projections 
Pim E DÍ C Ane, so that mi = Tro ne-yo pim), one can choose 
positive £, ô and select subsets of minimal projections, 


dg — {pio = Dİ ; g—n(hj +6) Z Trone—1] (0°) pim) < g—n(hj—9) 
(7.145) 
such that, by using Proposition 3.2.2, Theorem 3.2.1 and (7.144), for n large 
enough 


PCM) = Trone-ylpim) < POHD < PED (7,146) 


n n (2 

PeP) YP Trone-ypy?) 21-5 (7.147) 

Pien ECS” 

where pis =; i 

pu ECL” Pi: 
In order to use these arguments and conclude the proof of the quan- 
tum Shannon-Mc Millan theorem, some further results are needed. The first 
one deals with discrete subsets equipped with (not necessarily compatible) 


7.3 Quantum Spin Chains as Quantum Sources 391 


probability distributions and with the asymptotic behavior of their minimal 
cardinality. 


Lemma 7.3.1. Let D > 0 and {(taimm)} on be a countable family of finite 
sets In of cardinality #(In) with R Aa P distributions Tn = 


1 
{Pn (i) icr,- Suppose — logs #(In) < D for all n and define 
n 
sml Ta) = min{ log, #(2) : QC In, mR) >21- e} : (7.148) 
L In, n ti 
if {( T. Ia satisfies 


1 1 
lim —H(t)=h<oo (1) and limsup—azn(tm) <h (2), 
n 


noo n aoe 


for alle € (0,1), then 


1 
lim —Qen(tm) <h, Ve € (0,1). (7.149) 


noo n 


Proof: Let ô > 0 be arbitrarily chosen and distinguish the following dis- 
joint subsets of In: 


fo) = {i EIn : T(t) > Pa l 
n (ô) = {i ed, ¢ g—n(h+6 < Tn (4) < gaa , 
I3 (8) = {i Eln: T(t) < ey 


= b > 0; then, there exists n such that 


Suppose 1 > lim supp t™(I?(6)) 
) < 1—b. Choose 0 < € < b; if m,() > 1—e, 


Tn (I3(8) > band mq (14(5)UI2(6) 
Le Sm(Q) < 1-b+m(2N1(3)) implies 
b-E<T (2nR0)) < #(20 (9) g-m(ht3 and 


log. #(2 N 73(6)) > log,(b—«) + n(h + ô) , 


1 
whence lim =Qen(nn) > h+ ô contradicting the second condition in the 
noo n 
statement of the lemma. Thus, Jim mA (6) = 0. 


It also follows that I3(6) sana ynol contribute to the Shannon 
entropy H (rn); indeed, applying inequality (2.85) to the (non-normalized) 
OE: E, . . Tn (13(6) 
d t b t n i 5 d n := LAA 
istributions {pn (i) }er3 (5) an {a (i) FT3(6) 


that > iera (s) Pali) = Viers(5) In(2), yields 


\ , which are such 
i€ 13 (6) 
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1 ; . 1 ; . 
hs =-— DO mlog) <- Y pali) logs dn(i) 
i€I3 (6) i€I3 (6) 
1 ; 
=- S> pli) (logs 73O) — logs #(23(8))) 
i€ 13 (5) 


IA 


-r (E3(6)) ~ logs tn (E(8) + D mnl) 


The right hand side of the last inequality goes to 0 with n — oo due to 
Tn (13(5) + 0 with n — œ and because log, #(I,) < nD by assumption. 
Further, this very same fact implies limp... t(I}(6) = 0 for all 6 > 0, 
otherwise 


1 1 . . 1 . i 
nHn) = `. Pn(i) logs Pn (i) — a 5 Pn(i) logs Pn(i) — hp 
i€ 12 (ô) ic I2(6) 
< Tn (I,(6)) (h=) + Tn (I;,(6)) + hi, 
< h + 6(n(IA(6)) — mn (I2(8))) + BS 
would contradict the first condition of the lemma for sufficiently small 6. 
Consequently, lim 7,,(J?(5)) = 1 for sufficiently small 6. Thus, choosing 


n so that t,(Q) > 1—e and 7,(I?(5)) > 1-1, it follows that mn (anro) > 
1 — € — n, whence 


#(2 N 12(8)) >(1—e—n)2”®-0) implies 


1 1 
=O n(n) > —logo(l1—e—n) +h- ô. 
n n 


Since ô can be chosen arbitrarily small, the result follows. 


Returning to the probability distribution 7 associated with the ordered 
spectrum of p, fix e € (0,1) and set 


k 
Negi min{ 1 sked: Y rP >1- e} , (7.150) 
4=1 


so that ae o(7) = logy Nee. If 


1 
lim sup gN < s(w) , (7.151) 
£—00 
then, together with (7.142), this allows using Lemma 7.3.1. As follows: let Te 
be the set of indices labeling the eigenprojections |r Now |. Then, choose 
0 < 6’ <6 and consider the subset I?(6’) as constructed in the lemma and 
set 
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li £ 
P= S PNP. 
i€1?(6’) 
It turns out that, for @ sufficiently large, 
Tr(o RA) = SD P =e OR") 21-6. 
i€ 12 (6') 


Further, every minimal projection p < P:(d) dominated by P¿(ô) projects 
onto a vector of (C?)®%, 


psie Ws So vel), J efs 


i€ I? (ô) i€I? (6) 


whence, by the definition of the subset 17(6’) 


gQ—4(s(w)+9) < g—£(s(w)+5') < Tr(p p) <2 L(s(w)— 8’) <2 L(s(w)—ô) ; 
Finally, from Tr(p P,(5)) > 1—6 and 


EUU < Tro Pe) = SO of < HAE) 2 OO” 
i€ I? (6’) 


it follows that (1 — 8) 2%°)-®) < Tr(P,(5)) = #(7?(5)) < 2°°@)+9), thus 
concluding the proof of Theorem 7.3.2. 

Of course, it remains to be showed that (7.151) really holds true. The proof 
of this fact hinges upon a per se interesting result concerning the minimal 
dimension of the so-called high probability subspaces. Practically speaking, 
these are the relevant subspaces: as already seen in the case of Bernoulli 
quantum sources and as it will be showed at the end of this section, they 
allow for quantum compression with reliable retrieval. 


Definition 7.3.2 (Typical Subspaces). 


1. Given a quantum spin chain (Az,w), projectors pn E An such that 
w(pn) = Tr(p™ pn) > 1—e will be termed w-typical projectors and w- 
typical subspaces the subspaces of (C*)®” onto which they project. 

2. For any £ > 0, let 


Ben (w) = min {logy Tr(q) : An dq= g=¢, Tr(p™ q)>1- e} 
(7.152) 


The following result relates the spectrum of local states to the dimension 
of high probability subspaces [58, 140, 141]. 


Lemma 7.3.2. Be n(w) equals Nen in (7.150). 
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Proof: By definition, 


Ne,n Ne,n 
TOD [eo |) = TO 2) p B1-e, 
i=l =! 


whence Gen(w) < Nen. If the inequality is strict, then there exists a projec- 
tion q € An such that m := Tr(q) < Nem and Tr(p™ q) > 1—e. Then, using 
Ky Fan inequality (5.158), a contradiction emerges: 


m 


le < T(= lalla) < Sor <1-e, 


t=1 i=1 


where |q; ) (qi | are minimal projections such that q = 7", | qi) (qi l. 


For showing (7.151), the key result is 
Lemma 7.3.3. For an ergodic quantum spin-chain (Az, Ow) 


1 
lim sup Gen) <s(w), Vee (0,1). 


n—> oo 


Before proving it, observe that the previous two lemmas imply a quantum 
counterpart to the AEP (see Theorem 3.2.2). 


Proposition 7.3.1 (Quantum AEP (QAEP)). 
Let (Az,Oc,w) be an ergodic quantum source with entropy rate s(w). 
Then, for every0 <e <1, 


lim T =s(w) . (7.153) 


n=œ n 


Remark 7.3.3. Operatively, the previous proposition states that any se- 
quence of typical projections must project onto subspaces whose dimension 
goes as 2”*() asymptotically. 


Proof of Proposition 7.3.3 Lemma 7.2.1 ensures that for any £ > 0 and 
fixed 7 > 0 there exists L € N such that £ > L implies 


paH) e Hin j Haj E, 
ne 2 ne ne 2 


Consider the O£-ergodic components wj of w, the sets Ce in (7.146) and 


the smallest projector q”! := Vje Ag p” larger than all p”, FEAL, 
n £ 
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m=nl+r,0 <r < £, set qm := qe Line,ne+r—1] and, by means of (7.147), 
estimate 


M 1 ne-1 os 
Trjom—1) (pí dm) = ae Tr ae, am) 
amr 
ne—1 
a ` 7 (nb) (nt 
= iz T[o,ne—1] (2; q ) 
j=0 
1 u (n£), (n) #(AG ) a 
Sone T NE) DY) > mej Sl : 
on > Y[o,ne—1(P; Pj ee i ( 5) 2 7 


Then, definition (7.152) and (7.146) imply 


Beym(W) < logs Tro,m—1] (dm) = logy Trio ne—1(q) + rlogsd 


< log ( 5 Trio,ne-11 (p) + rlogyd 
jEAG,, 


< log, #(Az,) + n(E(s(w) +) +6) + rlogsd , 


whence the result follows from the arbitrariness of 7 and 6 and from 


1 : 1 : 
lim sup z Gem(w) < lim sup — log, #(AZ,,) + sw) + 04 7 


m— co m— oo 


Universal Quantum Compression 


Based on the classical construction of universal codes [325, 168], of which a 
particular instance has been given in Section 3.2.1, one may disengage the 
compression from its explicit dependence on the quantum source statistics 
by resorting to Universal Quantum Compression Schemes [163]. 

In the following, the construction in [163] will be slightly modified. We 
shall consider a quantum spin chain Az with an ergodic translation-invariant 
state w, an increasing sequence of local subalgebras An as defined in Sec- 
tion 7.3 and local states w fA, described by density matrices po”), 

The idea is to construct the analogous of the typical subsets A as in the 
proof of Proposition 3.2.3, with cardinality growing as 2”? and probability 
a) (AM) tending to 1 with n for all sources A (Bernoulli in that case) 
with entropy (rate) H(A) < R. As always when passing from classical to 
quantum sources typical subsets will be replaced by typical subspaces and by 
the associated orthogonal projectors. 
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Theorem 7.3.3 (Universal Typical Subspaces). 
Let s > 0 and € > 0. There exists a sequence of projectors Qs) E An, 
n EN, such that for n large enough 


Tr(a") < gn(ste) (7.154) 


and for every ergodic quantum state w on Az with entropy rate s(w) < s it 
holds that 
lim w™ (QE) = Tr(pMQM) =1. (7.155) 


n— Co 


Definition 7.3.3. The orthogonal projectors Qs) in the above theorem will 
be called universal typical projectors at level s. 


We subdivide the proof of Theorem 7.3.3 in various steps. 


Step 1 Let / €N and R > 0. Any Abelian quasi-local subalgebra C7° C Az 
constructed from a maximal Abelian @—block subalgebra Ce C Ae, together 
with the probability distribution w[C?° corresponds to a classical ergodic 
stochastic process. 

The results in [168] imply that, independently of the latter, there exists a 
universal sequence of projectors (corresponding to classical universal typical 


subspaces) py? E€ og C Am with 
1 n š n n 
z log Tr(p\")) < R, such that Jim, a! (py) =1 


for any ergodic state m on the Abelian algebra C?° with entropy rate s(7) < R. 
Notice that ergodicity and entropy rate of 7 are defined with respect to the 
shift on C7°, which corresponds to the ¢-shift on Az. 

One then applies unitary operators of the factorized form U8”, with U € 


Ap unitary, to the py and introduces the projectors 


we := VV Uep O e = A, (7.156) 
UcA, unitary 


These are, by definition, the smallest projectors such that, for all U, 


Ue ue" < ie . 


Let oe = Jaer ine x ne | be a spectral decomposition of one (with J CN 
some index set), and let P(V) denote the orthogonal projector onto a given 


(én) 


t : 
subspace V. Then, Wip can also be written as 


wer =P (span{u"| ig?) : iE I,U € Ay unitary }) . 


7.3 Quantum Spin Chains as Quantum Sources 397 
It proves convenient to consider the projectors 


Wee =P (span{ 4°" nee :ie€T,Ae Ac}) ; a < w f 
(7.157) 

Given m = nl + k with n € N and k € {0,...,@— 1}, let 

wee =w @l%*eAn, Wie =w o1 6A, 

These are projectors and, as in [160], one estimates the trace of wim) E Am 

as follows. By an argument similar to that used in the proof of Lemma 3.2.2, 

the dimension of the symmetric subspace SY M (Aen := span{ A8” : A € Ae} 

is upper bounded by (n + 1)%™“4®, thus 


Trw fe) = Twp’) Tr®* < (nt Tp < (atl) 29.2%. (7.158) 


Step 2. Consider a stationary ergodic state w on the spin-chain Az with 
entropy rate s(w) < s. Let £, > 0. If Z is chosen large enough, then the 


projectors ee where R := ¢(s + $), are d—typical for w i.e. 
Tr (00 wi") Silas, 


for m € N sufficiently large. This follows from the result in Proposition 7.1.9 

concerning the convex decomposition of the ergodic state w into k(@) < £ 
k(l) 

states wt 4 i ropa Wi 7 , that are ergodic with respect to the @—shift on 


Az and have an sate rate (with respect to the (—shift) equal to ¢s(w). 
Moreover, according to Lemma 7.2.1, for every A > 0, if one defines the 
set of integers Aga := {i € {1,..., k(O}: Sot) > &(s(w) + A)}, then 


t, 
these states enjoy the following property with respect to the von Neumann 


A 
entropy: jim acre = 0. 
Let C;,¢ be the maximal Abelian subalgebra of Ag generated by the one- 


dimensional eigenprojectors of the density matrices corresponding to wf E€ 
Ar. The restriction of w; to the Abelian quasi-local algebra C?9 > generated 
by Ci,¢ is again an ergodic state. From the properties of the Salting density 
and of the von Neumann entropy one derives the chain of bounds 


L. s(w) = s(w)) < sui IER) < SW Iie) = St) . 


Further, with A := B_ s(w ), ifi € Aj , one has the upper bound S(w 9) < R. 


Let U; € Ac be a unitary operator such that US” p} 7 Ute gm. For 
every i € Aj a, it holds that 


wk? (wht) = wh a a — 1. (7.159) 
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#(AG a) 


We can thus fix an £ € M large enough to fulfill — al 


ergodic decomposition to obtain the lower bound 


T in 1 ln tn ô P ln kn 
whe Vw! i n) > aD 5y wh" iw! TIe > (1- 5) min wie (wi?) : 


>1— 3 and use the 


Then (7.159) yields 


Step 3. One can now proceed as in [163] and introduce a sequence of inte- 
gers lm, m E€ N, where each fm is a power of 2 fulfilling the inequality 


LoS m< Umm (7.160) 


Let the integer sequence nm and the real-valued sequence Rm be defined by 
Nm = |Z], respectively Rm := lm (s+ $) and set 


Wm Enam) Ema 93-Lm 
KSim a 7.161 
oe Wenn @ id2"—!m"m) otherwise . ( ) 


Observe that 


4 log(nm +1) Rm 1 


1 
— log Tr QP < log TQ’ < 
m i i 


din 6fm +2 EEE = E 1 
= Ben] ST Bln 1” 


where the second inequality follows from (7.158) and the last one from the 
bounds on nm 


Pim 1< I oe OT 
lm bm 
Thus, for large m, it holds 
1 (m) 
— log TrQYY < s +e. (7.162) 
Pm ; 


By the special choice (7.160) of Zm it is ensured that the sequence of projectors 
(m) E Am is indeed typical for any quantum state w with entropy rate 
s(w) < s. This means that 1P cas is a sequence of universal typical 


projectors at level s. 
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7.3.2 Quantum Capacities 


The y-quantity in Holevo’s bound 6.33 limits the amount of classical in- 
formation that can be retrieved by a POVM measurement from encoding 
classical symbols i € I4 = {1,2,...,a} by quantum (mixed) states, i +> pi 
coming from a mixture p = J jer, Pipi € B} (HI) with a priori probabil- 
ities p;. In particular, the bound is in general hardly reachable; however, 
like in classical capacity theory, the amount of retrievable classical informa- 
tion per transmitted quantum state can be made arbitrarily close to the 
Holevo bound by means of suitable encodings of longer and longer strings 
i = iiiz in € I := I, x ---I,4. Also, the Holevo bound is a limit to 
—_—_-——” 


n times 
the classical information per letter that can be encoded into quantum states 


and retrieved with negligible errors. As we shall see, this state of affairs will 
lead to different definitions of quantum capacities. 
In order to prepare the ground for a detailed discussion, appropriate no- 
tations must be introduced. 
We spectralize p = Diver, T(a)| (a) )(r(a) |, set If := Ip x + + Ip, denote 
—>_E—-—_’ 


n times 
a = ajaz- an, a; € I,, and write 
r(a™) — J [ra , | a™ ) — Q | r(a;)) (7.163) 
j=] j=1 
pli j= = Pi, 8 Piz @+** Pin > Pa™) = Iv, (7.164) 


p= PE) oG™) = YI ra) a” (a | (7.165) 


imer almera 
On the other hand, the spectral decompositions of the quantum code-words 


pi = X plkli) |p(kli) )(p(kli) | , (7.166) 


ke J; 


with eigenvalues 0 < p(k|i) < 1 and eigenprojectors | p(k|2) ) (p(k) |, provide 
conditional and joint probabilities. 

Denote by A‘), respectively K”), the stochastic variables with outcomes 
i, respectively k™ = kar, t ki, € IR, where IR := Users J(i™) 
with J(é™) := x7_1Ji,. Finally assign to A™ and K) conditional and 
joint probabilities defined by Tm) Ar) = {PRE } mere ROE T(E)» 
where 


n) 


PERN) := TT] yl) (7.167) 
j=l 
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and by Tamyxm = {P(é, Boers were, where 
PUM, b™) := PUM) P(k p). (7.168) 


Shannon entropies are computed using (7.164) and additivity of von Neu- 
mann entropy, as follows: 


H(A™) =n H(A) =n X pj logs pi (7.169) 
i=1 
H(AM v K™) = H(A™) + 5 PUM) H(K™ |i) 
im) eI} 
=H(A®)+ X Pa) SG) 
im) eI} 
= n(H(A) + Y pi Sp) - (7.170) 
i=l 


According to the AEP (see Proposition 3.2.2), for any fixed £ > 0, we can 
distinguish a subset of 7 4(n)-typical strings iM e ui” a re) and a subset 
of Tamy«iny-typical strings (i, k™®) € yr) c ry x a. These subsets 
are such that 7 4) ( £) >1-—e and may pes (Ve) 
if &™ EUs” then 


> 1— e€. Furthermore, 


g-n(H(A) +e) < P(i™) < g—n(H(A)—e) ; (7.171) 
while if (1, k™) e VO”, then 


C> 2.1 PiS(os)+e) (AE fa piSos)-e) 


(7.172) 
As for classical capacity (see Section 3.2.2), we distinguish one more typical 


subset, Wi) c re) x 1, consisting of all jointly typical pairs, 4, k™) E€ 
(") where also i) €U™. From (7.168), (7.171) and (7.172), 


y” (sia)-x+22) s(p)-x-22) 


< P(k® 4) < gent (7.173) 


where (6.33) has been used and x is Holevo’s y-quantity for p and its decom- 
position p = }`;—; Pipi. Finally, in terms of (7.164) and (7.165), 


a) = SS PRM) [PEM EM) (PRM EM) | (174) 
R™MET Em) 

peas SO Pee Pe a Pee) atte) 
iln) erg 


k) Ee 7(4(™)) 
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where (see (7.166)) | P(k)|é) ) := Q7 | p(kyliz))- 

As showed in the proof of Theorem 7.3.1, for any € > O there ex- 
ists an orthogonal projector Me € B(H®”) that commutes with p8” and 
Tr(p®" Me) > 1 — £. Analogously, if the sum in (7.175) is restricted to WL”, 
one gets a positive operator n < p8” € BY (H®”) with 


Trin = `X P0, k()) 
(40) kew 


- ( Ss = 5 JPU, nm) >1—2e. (7.176) 


(40) OM EY a BOM) yey 
ar) gu 


Theorem 7.3.4. [136, 269] Let p = vic, Pipi € BY (H) provide a statisti- 
cal mixture of quantum states available for encoding symbols i € I4 emitted 
by a classical source; let x := x (p,{pipisier,). For any fixed 6 > 0 and suffi- 
ciently large n, there exists an encoding i™ + E(é™) = p(é™) on a subset 
Inc I% consisting of M strings and a decoding POVM 


(2%) > BO = { {wea |e) | 


iea 
an), k) yew) 
ui- Ð EEPE], 


Mer, 
an) kn) yew 


M 
such that | — — x| < ô and en < 6, where en is the decoding error 
n 
1 2 
en:=1- a > P(R EM) (G(R EM) | PROM EM) 
Mera 
kn) ega)) 


(7.177) 


The strategy of the proof consists of the following steps. 


1. choose an equidistributed set TyC I of M sequences i™ and to encode 
each of them by a density matrix €(é) = p(i™). The encoding thus 
provides the density matrix 


n 1 nC 
oe) =a 5 Eli™) 


imela 
1 n)je(n n)je(n n)je(n 
= NO PRM EM) | PRM EO) PRM i) | . (7.178) 
ea 


kn)egal)) 
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Choose random encodings E as in the proof of Theorem 3.2.3: the ar- 
gument does ensure that the required encoding and decoding procedures 
exist, but does not provide concrete instances of them (for a similar result 
see [144, 146]). 


. As regards the decoding protocol, the idea is to try to identify by 


means of (possibly of norm less than 1) vectors | W(k‘”)|i) ) only those 
P(k |) ) in (7.178) which are labeled by pairs (4, k\) € WS” 
after the same have been projected by H, onto the chosen high probabil- 
ity subspace of p™). All other | P(k)|é(™) ) will be made correspond to 
w(k™4™)) = 0. 


. The non-trivial decoding vectors are constructed as follows. (For sake 


of simplicity we shall denote multi-indices i, k™ as i and k. Set 
(i,k) ) := HMe| P(i,k)), consider the matrix S with entries 


Sein) ae) = (Bli, k) (DG, F) ) = (Pli) IPED), (7-179) 


where i,i € Ïa, k,k € J(i), J(i) and (i,k), (i,k) € WL”. This matrix is 
positive and its square root defines vectors |Y (i, k) such that [136] 


VS (in) aR) = (Pli) BEE ) = (V(kli) | PER) ) . 


Namely, Q~!/2| &(k|i)) where Q+? is the (positive) inverse square-root 


of Q := 5 | &(k|z) )( (kli) | (defined only on the range of Q), where 5 
(i,k) (i,k) 
denotes the sum restricted to the pairs (i, k) € we, 


. The error (7.177) is complementary to the ensemble fidelity (see Defi- 


nition 6.3.6) that has been used in Theorem 7.3.1. Using the previous 
definitions and that Q~!/? > 0, it can be bounded as follows: 


en Sap DL PED (1- (OID POW) 
Ket) 


2 - 
<2- o 2 PONV See - (Ien) 


icĪa (i,k) 


Proof of Theorem 7.3.4 Since 2(x— y£) < x—2? for all z > 0, the same 
inequality holds by substituting x with the positive matrix S in (7.179); 
taking diagonal values: 


3 le 
VIa) 2 F SGG Z 5 DY SHAG. SGA) - 
Ge) 


Then, using the first inequality in (7.173), the bound (7.180) can conveniently 
be recast as follows 
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3 * 
<2- Dd Dd Pli) Seria 


i€L 4 (ik) 


wv yee (kli) SkA) 


icĪ, (i,k) 
ET x+2e) f l 
+ 5 > P(t) PUJI) Sa, SG,0s(4,k) 


i jela (i,k)#(j;£) 


Suppose now to choose the M words ie] a randomly according to the 
probability distribution P(i); we obtain in this way a statistical ensemble of 
random codes and, as much as in the classical case, by averaging over the 
contributions of the randomly chosen i™ one eliminates the dependence on 
I, and remains with a sum over all i™ € Ii. Therefore, the average error 
can be estimated from above by 


<2-3 5 > P(i i) Slak)ilik) (7.181) 
ier (ik) 
N 
Lı 
eA Sri PU Sawik) (7.182) 
ier (isk) 
Loa 
M(M —1) z 
gnr(S(p)—x+2€) 
T M 


>o J POPO) Plkli) PUI) Sergo SG. .m) (7-183) 


jel? GHAGE) 


From (7.179) and (7.176), 
l= Tent) = e(a) = Tr((An m p®") Tz) >1-3e (1). 
Further, S¢,4);¢,6) < 1, thus Loa < 1 (2a). Finally, the last sum can be 


bounded from above by observing that if 0 < A < Band 0 < C < D, then 
by means of cyclicity under the trace operation, 


Tr(AC) = Tr(VACVA) < Tr(AD) = Tr(VDAVD) < Tr(BD) . 


Therefore, since IJ; commutes with p8”, Lop < Tr (aF): on the other 
hand, from the quantum AEP we know that the dimension of the subspace 
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projected out by Me is < 2"(S()+®) with eigenvalues < 2—"(S(/)-©), whence 
Lop < 2-(5()—82) (2b). Altogether, inequalities (1), (2a) and (2b) yield 


e% <2 — 3(1— 3e) + 1 + M2775.) = ge 4 277x-R-5e) | 


m — 


where we have put the growth rate R into evidence M = 2”F, The latter 
can be chosen arbitrarily close to the Holevo x quantity and still the average 
error becomes negligible with n — oo. Therefore, for any ô > 0 and n large 
enough there is an ic I with R > x— ô and epn < ô. 


Example 7.3.2. Suppose the sender encodes classical symbols 7 € I into 
states that she obtains by acting locally with unitary operators U; on her 
system in a state p12 E€ Ma, (C) ® Ma, (C) which she shares with the receiver. 
She selects the unitary operators U; with probabilities p;, and after changing 
pı2 into pi = Ui l2 p12 ut ®1 she sends her system to the receiver. The sender 
tries to maximize the information accessible to the receiver by optimizing the 
Holevo bound, thus seeking [71] 


Cu := ms- resto] > p=) Pip. 


Note that S (pi) = S (p12) for unitary transformations do not change the von 
Neumann entropy; in order to maximize S (p), consider the marginal states 


p) =Tro(p) , PP = Tri(p) = pa (= Tri (p12)) - 


By subadditivity (5.160) and (5.155), 
Cu <S (0) + S (p2) — S (p12) < log dı + S (p2) — S (p12) - 


Choose as unitary operators the d? Weyl operators Wa, (n) of Example 5.4.2 
with equal probabilities 1/d?; then, using (5.88) and (5.30), one gets 


1 l 
=g XO Wa(n) 8 zp Wa, (n) ® l = gee? 
n=(n1,n2) 
so that Cm > log dı + S (p2) — S (p12). This transmission protocol can thus 
achieve an optimal quantum transmission rate Cm = log di +S (p2)— S (p12). 


In general, like in classical transmission, the quantum states that have 
been used to code and transmit information are subjected to perturbing 
effects of the transmission channel which is being used. Concretely, if the 
the quantum code-words are projections P; € Ma(C) chosen with probabil- 
ities p;, thus making a statistical ensemble described by the density matrix 
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p = >¢;pi Pi, a probability preserving channel acts on them as a trace- 
preserving CP map A. In the light of the previous theorem, the channel 
capacity is defined by [145, 276] 


CulAl = mar [suw -= >n S cary} 


As much as for the entanglement cost (see (6.8)), in order to improve the 
capacity, one may consider n uses of the channel, thus a CP map A®” acting 
on states on Mj(C)®”" which may carry entanglement between different uses. 
Then one introduces the regularized capacity 


Cx[A] := lim 1 oy [Ay . 


n-+o n 


Such a limit exists because the capacity is superadditive; indeed, consider 
Cy[A1 ® Ag] and two statistical ensembles p ; PMY} and p ; PM} that 
achieve C'yy[A;], respectively Cyy[ Ao]. The additivity of the von Neumann en- 
tropy over tensor products states implies that, for the not necessarily optimal 
statistical ensemble, {pp op p% Q PIY, 


Cult: 4a) > S (alo) - D790? s (ar) 


+s (421p (2)] |) - 3 p S (4 regi) = CAC al - 


Were the capacity additive, the regularized capacity would coincide with 
Cy[A]. This is another important open question in quantum information 
which is actually equivalent to the additivity of the entanglement of forma- 
tion [276] (see also [43] for an approach to this problem based on the relations 
between the entanglement of formation and the the entropy of a subalgebra.) 


Bibliographical Notes 


A most exhaustive and complete review of the algebraic approach to quan- 
tum statistical mechanics is provided by [64]; in [65] one finds plenty of ap- 
plications to spin and continuous systems. A fully developed mathematical 
theory of the canonical commutation and anti-commutation relations, quasi- 
free states and quasi-free automorphisms can be found in [237, 200, 201, 202, 
255, 256, 257]. 

Quantum ergodicity and mixing are presented in [260, 277, 300, 107, 64] 
in increasing order of mathematical sophistication; the second reference has 
provided most of the material of this book concerning these topics, the first 
one concerning mixing. In [64] one also finds a detailed discussion of de- 
composition theory, while in [300] more recent developments are presented. 
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In [215, 221] what has been called mixing in this book is termed clustering, the 
qualification mixing being assigned to a stronger clustering behavior which is 
discussed in connection with Galilei-invariant interactions [218]. In [107, 300] 
one finds enlightening discussions about the physical meaning of the different 
algebraic factor types and of Tomita-Takesaki modular theory. 

Applications of quantum mechanics with infinite degrees of freedom to 
collective phenomena and thermodynamics can be found in [274], while 
in [290, 291] the emphasis is more on symmetry breaking phenomena and 
on the existence of inequivalent representations of the CCR and CCR with 
applications to physically relevant models. 

Quantum information related issues involving infinitely many degrees of 
freedom and the necessary mathematical tools like quantum compression the- 
orems and quantum capacities are discussed in [145, 239, 250]. For a review 
of different formulation of quantum capacity related quantities and their re- 
lations see [182]. 


Part IHI 


Quantum Dynamical Entropies and 
Complexities 


409 


The last part of the book first deals with two extensions of the Kol- 
mogorov dynamical entropy to quantum systems and with their applications. 
Then, it discusses some recent generalizations to quantum systems of classical 
algorithmic complexity. 


8 Quantum Dynamical Entropies 


The first part of this book has been devoted to illustrate some of the many 
properties of the classical dynamical entropy of Kolmogorov and Sinai; in par- 
ticular, it has been showed that it provides the optimal compression rate of 
ergodic sources (Shannon-Mc Millan-Breiman Theorem 3.2.1), while, through 
the positive Lyapounov exponents (Pesin’s Theorem), it measures the dynam- 
ical instability of classical dynamical systems; finally, it gives the complexity 
rate of almost all trajectories of ergodic dynamical systems (Brudno’s theo- 
rem 4.2.1). 

Several extensions of the KS entropy to quantum dynamical systems can 
be found in the mathematical and physical literature (see the bibliographical 
notes at the end of this chapter). All of them predated or were developed 
independently of quantum information; due to its rapid growth, the latter 
more and more appears as an ideal ground for testing the physical meaning 
and the technical usefulness of these proposals. 

One of the aims behind the attempts at defining quantum dynamical 
entropies was the possibility of classifying quantum dynamical systems, as 
much as it had been done for classical dynamical systems by means of the 
KS entropy (see Remark 3.1.2). Afterwards, the quantum dynamical entropies 
have been applied to the study of quantum chaotic phenomena and the quan- 
tum/classical correspondence; recently, they have been used to shed light on 
certain foundational aspects of quantum information, like quantum capacity 
and quantum algorithmic complexity. 

Of the various quantum dynamical entropies that have been proposed 
in recent years, we shall mainly focus upon two of them; namely, the en- 
tropy of Connes, Narnhofer and Thirring [88] (CNT entropy) and of Alicki 
and Fannes [9] (AFL entropy) +. These quantum dynamical entropies em- 
body two radically different ways of approaching the notion of information 
production in quantum mechanics; indeed, they may behave differently on a 
same quantum dynamical system. 

In Section 2.4, we have seen that partitions of the phase-space into 
finitely many, disjoint measurable atoms provide classical dynamical systems 
(X,T, js) (see Definition 2.2.2) with symbolic models: finite measurable parti- 
tions P = {P;}ier can be used to successively localize the moving phase-point 


"The L in AFL stands for Lindblad who introduced the notion in [194, 195, 196]. 
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within their atoms P; and to quantify the predictability of the dynamics via 
the information relative to the next time-step that is gained by observing the 
evolving system. 

In Chapter 3, partitions have been interpreted as POVMs taken from 
a commutative dynamical triple (L%°(4’),Or,w,,), where atoms have been 
identified with their characteristic functions and thus with orthogonal pro- 
jections summing up to the identity (see Definition 2.2.3.2). Thus, partitions 
P define partitions of unit (see Definition 5.6.1) and CPU maps Ey on the 
C*-algebra B(L7,(4)). However, because of commutativity, the action of E 
on Lv°(4’) reduces to the identity map; indeed, for all f € L7°(¥), 


x [fl(x) = $ xp,(2) f(@) xp,(2) = $ xr, (2) f(x) = f(@) . 


iE] tel 


The dynamics is thus insensitive to measurements, OroEx =ExyoOr = Or, 
as well as the states on L% (&): Fx [w,] = wu, where Fy is the dual CP map 
such that Fx[w,](f) = w,.(Ex[f]). 

Given a quantum dynamical triplet (A, O©,w), if one wants to extend the 
notion of partition to such a non-commutative context, a natural step is 
to substitute commuting projections with non-commuting ones or, more in 
general, with non-projective POVMs . Differently from the classical case, the 
CPU maps E : A+ A associated with them do not in general commute 
with the quantum dynamics, 90 E 4 © Æ Eo O, nor do the dual maps 
F preserve the quantum state, Flw] # w. Both E and F act as external 
perturbations; therefore, a preliminary question arises whether one should or 
not incorporate measurement processes into the very construction of quantum 
dynamical entropies. 

If the answer is yes, then, beside the dynamics itself, measurement pro- 
cesses themselves may act as a source of randomness; on the other hand, if 
the answer is no, the regular and irregular features of the dynamics refer 
to the system only, but are insensitive to the typical quantum phenomenon 
that getting information about quantum systems in general perturbs them. 
In other words, a perturbation-free quantifier of quantum dynamical random- 
ness might not measure the actual information production that always comes 
from observations of the time-evolving system; on the other hand, a quan- 
tifier of quantum dynamical randomness that takes into account acquisition 
of information through measurement processes would add the randomness 
coming from the latter ones to that proper to the quantum dynamics itself. 

Purpose of this and the last chapter is to convey the idea that, unlike 
in classical dynamics, randomness in quantum dynamics has more than one 
facet and that choosing one of the two answers above just means exploring 
two of these inequivalent aspects. 
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8.1 CNT Entropy: Decompositions of States 


The non-commutative algebraic structure which more closely resembles a 
commutative one is that of type IJ, factor von Neumann algebras A (see 
point 2 after Definition 7.0.8). Indeed, the state w which makes A a type 
II, factor is a normalized trace such w(X Y) = w(Y X) for all X,Y € A. 
The CNT entropy [88] generalizes to generic von Neumann algebras previous 
extensions of the KS entropy to type IJ, factors that were based on the above 
commutativity with respect to the state [107, 89]. 

The CNT entropy quantifies the information rate in quantum dynamical 
systems described by algebraic triplets (A, O©,w) and it does it independently 
of external measurement processes, by relying only on the algebraic properties 
of A, O and w. The basic idea of the whole construction is a clever use of 
the relation between entropy and relative entropy that has been discussed 
in Section 6.3.1 in relation to the entropy of a subalgebra (more in general 
of a CPU map: see Definitions 6.3.2 and 6.3.3). Before getting to that, we 
indicate why, in general, the steps that in Section 3.1 led to the KS entropy 
are not practicable in a quantum setting. 

Suppose M C A is a finite-dimensional subalgebra; if (A, O,w) is a clas- 
sical dynamical triplet, the KS entropy is constructed by considering 


— the finite partition corresponding to the subalgebra M; 

— the partition M™ = V O@*(M) generated by the time-evolved parti- 
tions {O"(M)}22); 

the Shannon entropy of the state w restricted to M™: H(w | M™); 


| 


1 
— the asymptotic rate lim —H(w}M”). 
noo n 
In the non-commutative context, by sheer analogy one might consider 


— any finite-dimensional subalgebra M C A; 
the finite-dimensional subalgebras M := Vics @*(M) generated by 
the n (finite-dimensional) subalgebras O*(M)?—9: 


the von Neumann entropy of the restricted state w |M ™: S (w M e 


| 


| 


1 
— the asymptotic rate lim —S (v M e), 


noo n 
However, this argument generally fails and the reason why it does fail is 
non-commutativity [301]: despite being finite-dimensional, the subalgebras at 
different times O*(M) need not commute and can thus generate an infinite- 
dimensional subalgebra M“ so that S$ (w M ™) cannot in general be con- 
trolled. 

In order to overcome these difficulties, the idea is to extend the entropy 
of a CPU map H, (7) (see Definition 6.3.3) to the entropy Hu (71, 72,---» Yn) 
of n CPU maps yi : M; => A from finite-dimensional C* algebras (that we 
shall always suppose with identity) M; into A. 
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Remark 8.1.1. As in Section 6.3.1, when M is a subalgebra of A, then one 
chooses as CPU map y the natural embedding tm of M into A. Therefore, 
when dealing with the set of subalgebras {01 (M as the n CPU maps are 
given by yj := O’ o tm. 


Like in the case of H, (y), we shall consider linear convex decompositions 
of the state w in terms of states w,(n). These states will now be indexed by 
strings i = iiz- in, ij € Ij, each CPU map q; being associated with 
a generic index set J;, carrying a total weight Mi”), Fixing ij € Ij, after 
summing over the all other indices and after renormalization, one obtains 
auxiliary decompositions of w associated to each 7;. Concretely, from the 
multi-index decomposition 


w= So Awww, I= hx Ip x-+-dp (8.1) 
MET) 
one obtains subdecompositions w = X ri, Wi, j=1,2,...,n, where 
ij él; 
som 
. y je 5 } 
wi, = yi Wiin) 5 ri, = Atn) . (8.2) 
i(n) ij i(n) 
ij fixed ij fixed 


Let A™ := A be the probability distribution associated with 


um) — 

the weights in (8.1) and A; := fa } ; the marginal probability distri- 
7) ijElIj 

butions consisting of the weights in (8.2). The generalization of (6.3.3) is as 

follows. 


Definition 8.1.1 (n-CPU Entropies). Given a C* algebra A equipped with 
a state w, let y : Mi C A, i = 1,2,...,n, be CPU maps from finite- 
dimensional C* algebras into A. Their entropy with respect to w is: 


Ho (41, Y2; -<3 Yn) (= sup [aao Sna) 


W=} jin) Az(n) y(n) j=l 


I5 D N S (ol w rs) (8.3) 


j=l ijel; 


= sup faa) -X` H(4;) 


W=} gin) Agim) Y(n) 


+¥-(se M) -= E xs (of mi) , (8.4) 


ijElj 
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where, with n(x) := —x log x, x € [0,1], 
=J nQim) , HM) = J na). 
i0) ijEL; 


As in Section 6.3.1, a concrete way to construct decompositions of w is 
to use the GNS construction based on the state w; it follows that the states 
Win) contributing to w = J jin) À; Wim) are in one-to-one correspondence 
with the positive elements of the commutant of Tu (A): 


Aim Wim (X) = (Ro | Vo Tul X) 2.5) , (8.5) 
where 0 < Yia) € Tu( A) and Liste) Y, jin) = 1. Also, if w is faithful, one can 


express the decomposing states in terms of 0 < X € Tu ( A)” by means of the 
modular automorphism ow (see (5.181) in Remark 5.6.1.3): 


Awi (X) = (Qu |04? (ToYi) TXR), (8.6) 


where 0 < Yyony € A and Jgn) Yz = 1. 
In analogy with (6.47), we shall denote by 


3 


Haina (Lee wien} := H(A™) — 5 H(A;) 
j=l 


+O siom), E7 


j=1 ijElj 


the contribution to the n-subalgebra entropy coming from a chosen decom- 
position of the state w. 


The n-CPU entropies enjoy a number of very useful properties that can 
luckily be proved without being obliged to know the optimal decompositions; 
the first one of these is a generalization of Proposition 6.3.4. 


Proposition 8.1.1. Given a C* algebra A, a state w on it and CPU maps 
y : Mj = A, j = 1,2,...,n, from finite-dimensional C* algebras M; 
(dim M; < d) into A, consider a decomposition w = J imer) u Wiin) 
(n) Ao w, a(n) where 
IM := Jy x Ja xX- -Jn is a multi-index set with card(J(™ ) tee ond and 
c and j™ := jijo- +- jn, Ik € Jy, such that 


{yg }5 me (n) {yi}Fe1 1 1 
H, A ain SA Nia) 3 Weta E. 
w im PE Smer) S I I J genye gin) i 


and e > 0. Then, there exists a decomposition w = X., 
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Proof: Following the steps in the proof of Proposition 8.1.1, consider par- 
titions Zj = = {Zi}. 1-1 Of the state-spaces S(M/;) into atoms Zi, such that 
if ZE are two states on M; belonging to any Ziy? then ||v, — və|| < 6. The 
cardinality n; of each of these partitions can be chosen not larger than the 
smallest number, r, of balls of radius < 6 needed to cover each S(M;), a 
number which depends on dim M; and thus on d. Given the decompositions 


= ` A ieee and aa we N E 


imer) 


one constructs the states 


AE 
F îk Jk f a k 
Wip "5 > X Wip Nin — ; l rin 
ikEIk jk ipely 
k k k k 
of Ezp wh ezh 
yr) 
4 i X in) 1 = } ` (n) 
Wj) := a Win) ; Ajo := A) ; 
im) ern) Jra in erln) 
wk EZE ,k=1,2,....n wk ezi, »k=1,2,...,n 
k` Ik “ik 


and the corresponding decompositions 


Ww = ) rv Ginn (n) and w = ) Nip, ik * 


JMET ÍkEJk 


Then, introducing the probability distributions A() := Be and 


um) oe 


A’ = {as } asset together with the respective marginal distributions 
jM EJE 


.— fk = ti 
Ar = LAF Jaen and Ay = {Ah Inen one estimates 


HE bone P — A = 
H(A™) — H(A’) +D (nu 4) — H(Ax)) 


+0 D ARS WE be & = >> ApS (wh, Pre w Pre) 


k=1 ikElk jkEJk 
KNE 


Indeed, according to the proof of Proposition 8.1.1, each term in the second 
sum over k is < €e, while the first line after the equality is < 0. This can 
be seen by considering the A, as probability distributions over partitions 
Pk with atoms Pr and A(”) as a probability distribution over the finite 
partition P™ := V$- Py. Then (compare Remarks 2.4.2), the Aj, and Aj, 
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are probability distributions over coarser partitions Qk < Pk, respectively 
Q) := VE 2k x P™, whence, using the conditional entropy (2.90), 


H(P™ v Q™) = H(P™) = H(Q™) + H(P™)Q™) 
A(PrV Qk) = H(Pk) = H(Qk) + H(Pr|Qe) - 


Thus, from Lemma 2.4.3 and Corollary 2.4.2, 


3 


n 


H(P™) — H(Q™) < X H(P,|Qx) = (HP ) — H(Qx)) 


k=1 k=1 


From proposition 8.1.1 it follows that for any € > 0, there exists a finite 
decomposition w = ) jin) yim) wel Wim) Such that 


{yj }j= 
Hs”? (2 w wir en) 2 Hu (11, 92)-+-5In) = NE , Ae 


while, from Definition 8.1.1, 


{yj }ja1 (n 
Hs (Y1; Y22---In) 2 Hy 7 (Lage wi Peer) ` 


We shall call these decompositions ¢-optimal and remark that their cardi- 
nality r := #(I™) depends on £ and on the maximal dimension d of the 
finite-dimensional C* algebras on which the y; act. Then, the n-CPU map 
entropies result equicontinuous in the maps 7; : M; +> A with respect to the 
topology defined on their linear space by the norm 


lly — Yella = sup 11 = V2) X) luo ; (8.9) 


IXI <1 


where ||X||?, := w(XŻX) for all X € A. 


Proposition 8.1.2. Let yj and Bee j=1,2,...,n, be CPU maps from finite- 
dimensional C* algebras M; with dim M; < d into A. Then, for any € > 0 
there can be found 6 > 0 depending on £ and d such that 


Ho (TiTe n T Ho (Tas Th) < nE 
when |y; = Gle < ô. 


Proof: Consider a finite decomposition of cardinality r as in (8.8) with 
e/2 in the place of ¢; then, 
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H (V1, V23- - -3 Yn) —H, (Jio Yar sa Ss < a = (a wen } ) ) 
g(r eI) 
n 
{ype nE 
HS SG (LAs 40 hne] + > = (s (wo qj) = S (woy) + 
j=l 


+ EM (50,0) -s(t E 


ij EI; 
Choose 0 < 6 < ðo and ôo such that the Fannes inequality 5.157 implies 
€ 
|S 11M) - S(@21M)| < = 
when M is a finite-dimensional C* algebra with dim M < d and v12 E€ S(M) 
are states on it such that ||vy — v2|| < do. Since |Jw(X)| < ||X]|. (see (5.49)), 


it follows that 


lwo(y-—yl= sup lwo (yy —¥)(M)| < -hle < 5 < ôo , 
MeM,||M||<1 


whence X(s (wog) -S wo) < ne. 


In oc to estimate the remaining sums, let us divide each index set I; 
into two disjoint subsets, namely 


o g 
B= {yen x2 Fh 
0 


and its complement. Since w = )> Alw, from (8.9), the state-based 


ijEl; 
distances are such that, for all i; € - 


ly — Wlej, i ra ly = Gle < 90:5 


so that D a A, (s (wo7yj) — S(wo %4)) < n 3 Finally, using that 
j=l ijEI9 
card(I;) < card(I™) < r and dim M; < d together with (5.155), one derives 


l , , §2 
> ri, (s (uf o y) -S (o, o %4)) < E logd . 
ij¢1) 
6? E. 
Therefore, 6 < ĉo such that ra logd < 6? yields 
0 


€ 
Ho (V1: V2- -3 In) — Ho (V-an) < N +35) = ne, 
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whence the result follows by exchanging the sets {7j;}7_, and {yj }7.1. 


Other properties that are important for applications to concrete quantum 
dynamical systems are the following ones. 


Proposition 8.1.3 (Properties of n-CPU Entropies). 

Given a C* algebra A equipped with a state w and n CPU maps yi : 
M; = A from unital C* algebras M;, j = 1,2,...,n, with dim M; < d, 
into A, it holds that: 


1. the n-CPU map entropies are positive and bounded, 


O< Hais tase. a) <X Hs (73) < SS (wory) < n logd ; (8.10) 


j=1 j=1 


2. the n-CPU map entropies do not depend on the order of their arguments: 


Ho (71:2; Guini Yn) =H, (Ir) Ya(2) aes Veta) ) (8.11) 


with n(i) any permutation of 1,2,...,n; 
3. the n-CPU map entropies are not sensitive to repetitions of their argu- 


ments: 


Ho (Y15 +++ W1 V Ws V+- Mn) = Ho (M15 9-15 V Vjt Yn) 5 
(8.12) 
4. if O: A— A is an automorphism such that w o O = w, then 


Ho (90%,00%2,---;90°%n) = He (915 V23- Yn) ; (8.13) 
5. the n-CPU map entropies are subadditive: 
Ho (Gir -3 Yp Vp+lo- e+ ie) < Hy (71: V2, miss , Yp) 
+H (Yp415 Yp+2:-- >Yn) ; (8.14) 


6. the n-CPU map entropies are monotonic under composition of CPU 
maps; namely, if Y : Nj; +> M; are CPU maps from finite-dimensional 
C* algebras N} into finite-dimensional C*-algebras M}, j = 1,2,...,n, 
which are in turn mapped into A by CPU maps y;, then 


Hy ("1 ogi Shanes Tn o Yh) < Hou (Yis Varsa) ; (8.15) 


7. the n-CPU map entropies increase by non-trivially increasing the number 
of their arguments: 


Ho (Yii Vares Yn) < Ho (Vir V2 ey nr Ynt) + (8.16) 


Before proving the previous properties we examine some simple cases 
where, in analogy with Example 6.3.6, the n-CPU map entropies can explic- 
itly be computed. 
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Examples 8.1.1. 


1. 


If in (8.15) one considers as CPU maps y; and yj the natural em- 
beddings of subalgebras N; C M; C A, then that property asserts 
that the n-subalgebra entropies increases under embeddings into larger 
subalgebras. Suppose the finite-dimensional C* subalgebras {Mj}? 
to be such that they together generate a finite-dimensional subalgebra 
M” := Vi ‘_, M;. This is the case, for instance, when each M; is a 


spin-algebra at site j on a lattice so that M” (") is the algebra of n spins 
at sites 1,2,...,n. Then, M; C M) whence property (8.15) together 
with property (8.12) and property (8.10) give 

Ho (Mı, Mə2,..., Mn) = Ho (MO, M, Mi) 


=H, (um) <s (v pa”) . (8.17) 


. Suppose A is an Abelian von Neumann algebra and {A;}"_, are finite di- 


mensional subalgebras generated by minimal projectors Gri 1- Then, 
consider the products Gim) := Gii Azis Onin» IM = x7_11;, where 
i™ = ijig? İn, ij E Ij and I; = {1,2,...,d;}. Because of E 
ity, these are projectors that one can use to decompose w as follows 


= >` Àj) Wiin), Wi gin) (a J= oa wlasna) VacA. 


ier) wâ) m) 
Further, the various probability distributions and elements of the subde- 
compositions amounts to 


x 7 . w(@ji;a) 
AM = {wai Jimeno , wl (a) = S 


j waju) j = {w(Gji;) Jier 


It thus follows that 1) the states w |A; = A; and the states wf [A; are 
pure states because the orthogonality of the minimal projectors implies 


he w(G;ji,@jx) 
wi (@jk) = Ce ) E Sisk 
J 


Since the minimal projections an generate the Abelian subalgebra 
A” :=\/"_, Aj, this yields 


aa Sse wm }) = S> Af) log a), = s (w pa) 


MET) 


Because of (8.17) this result is optimal; thus, 
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Hy (A1, A2,- .., An) =S(w pam) (8.18) 


Notice that the latter von Neumann entropy is the Shannnon entropy 
of the random variable Via A; distributed with probability A, where 
the random variables A; are distributed with marginal probabilities ⁄4;. 
The outcomes of these random variables correspond to the minimal pro- 
jections @j;; according to Section 5.3.2, via the Gelfand transform these 
can be turned into characteristic functions of the atoms of suitable mea- 
surable partitions. 

3. The result in (8.18) also holds when the A; are commuting Abelian finite- 
dimensional subalgebras of a non-commutative A, but w is the tracial 
state. Indeed, the minimal projectors of the A; provide an optimal de- 
composition as in the previous example. The reason is that the modular 
automorphism of the tracial state is trivial; thus, (8.6) yields 


Agemwyem (a) = (2, |04? (mas Go) Mola) IR) 
(Qu. | Tw (Gg(n) @) [Qu ) =W 


(Gin) a) . 


4. Suppose {M j}j=1 are finite-dimensional C* subalgebras that generate 
a finite-dimensional subalgebra M™ := V _ı M; C A. Further, sup- 
pose they contain pairwise commuting Abelian subalgebras A; C M; 
each belonging to the centralizer of w ? and such that the algebra 
AM = Viet A; they generate is maximally Abelian in M). Since 
the Aj; pairwise commute the products of their minimal projectors @;;, 
provide the minimal projectors Gj(n) of Ae By assumption, they are 
left invariant by the modular automorphism of w and can thus be used 
to decompose w as in the previous two examples. Then, from the second 
example above and from Example 6.3.3.2 one derives 


Ho (My, Mp,...,M,) > HS (10l), Wim) Jeera) 
=5 (w pam) = (w mM) . 
Therefore, the first example yields 
Ho (Mı, M2,..., Mn) = S (w| Mı V M2V---M,) . (8.19) 


Proof of Proposition 8.1.3 


1. Positivity comes from choosing not to decompose w at all, in which 
case the argument of the supremum in (8.3) vanishes. Further, the first 
line in the argument of the supremum equals minus the relative en- 


tropy (see (2.94)) S (a Kt) of the two probability distributions 


? They are therefore left pointwise invariant by the modular automorphism of w. 
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AM) = cl eee respectively A(™ := Ta Mb oero on the 
strings set of strings i’. Since the relative entropy is non-negative, the 
upper bound to the n-CPU map entropies follows from Lemma 6.3.1. 

2. In order to show (8.11), let w = Doyen) ese be an e¢-optimal 
decomposition such that, as in (8.8), 


{yi hfe (n) 


{yn(ahfa 
Since Ho 7” LT wa) }, ee where 7('")) denotes the 


string bee (1) bae(2) ors n(n) equals | i (Aws i(™) } ee) it fol- 
lows that 


{ra }fe1 ({ (n) \ ) 
< ‘ + 
H,, (1; Y2- Yn) < Ho AG Wri) imer) NE 
< Hv (Y0) V2)» tan ,Yn(n)) nE. 


Equality follows from the arbitrariness of € > 0 by exchange of the sets 
{7} j=1 and {7r0)}j=1: 

3. In view of (8.11), to prove (8.12) we show that H, (Y1, Y2,- <, Yn) does 
not change if the argument 7, appears twice. Consider an ¢-optimal de- 
composition for the right hand side of (8.12) and, according to (8.5), the 
corresponding positive decomposition of unity in the commutant 7(A)’, 
{Yie hier. Then, construct a new decomposition 


Y(n+1) ~ 
WwW = y Nyon Oy (n+1) 


GetVe (nth) 


based on a decomposition of unit is of r(A) > Y, mr i= Yyo IU, 
where j("F) = ijn +1; i™ e I™ and Jn+1 © Invi = fy. Then, 


Jk Uk 


MOD suf en = AMO, , Gok Vex n+l, 
while Ant = 1 and &"*! = w. Therefore, 
n+1 In+1 
poe (n) y ) 
os < H} 4 
Holi a a) SA Aie Yi rera NE 
2 = gp —1UYn (a f Fjern } 


< Ho (V ey. ea) ENE. 


+ née 


ee 


In order to invert this inequality, let 


y (n+1) 
w = Aint) Wi (n+1) 


qr) Ey(rt+1) 
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be an €-optimal decomposition for the left hand side of (8.12) and consider 
w= `> xO, Ww, s(n) 5 


jeJ 


where j™ = jija- jn with jy = ip for 1 < k < n — 1, while jn € Jn := 
In x In41 enumerates the pairs (inin41) so that J = I x-+-In_1 X Jn, 


xe, = MO), Gym) = wim. If 1 < k < n—1, it also turns out that 
x = A and oF = wk, while 
n _ (n+1) oa o 1 (n+1) 
Nin = X } Nery cs | eo yn X ) r; n+) WiN) + 
(n41) Jn i(n+1) 
invin 41 fixed in sin 41 fixed 


Then, A := {x0 = Ay for 


(a) hoew 


1 < k <n -— 1, whereas An = (Xp, k Finally, one can estimate 


= A+) and A, := {Ak} 


in el 


Heli tee a a) > HE (Taa) 


n—l1 


-Eua +> 35 NS (Go, wor) 


j=1 1; €1; 


pz > Nin (GP om, WO%m) - — (*) 


In€In 


m 


From (2.88) it follows that H(A») < H(An) + H(An41). On the other 
hand, with jn = (in, in+1), 


w oe Yn ~n wrt! = n gr 
Win 7 A2 À es Win , Dinti = yeti Nw jn 
tn inti “rel in 
and (6.31) imply 
n+1 
> ARS (Gorm, wom) >) DIAS (WE om, work) 
ÍnEJn k=nikElk 


Together with (*) this yields 


{75 }F=1U Yn (n+1) 
Ho (Y1: V25: ++» Yn—1, In) 2 Hay (AB wn Paroaria) 
> Ho (1V2: Yn mn) — NE. 


. Property (8.13) is a consequence of w0 O = w. In fact, given an £-optimal 
decomposition for the right (left) hand side of (8.13) and the correspond- 


ing positive decomposition of unit in the commutant, eae hence 
n el n 
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then {US Y% Usb A gur Ut hs ca ), where Uw is the uni- 
tary GNS implementation of ©, provides a decomposition for the left 
(right) side. 

5. In order to prove subadditivity, let w = J e) re be an ¢-optimal 
decomposition for Ho (71, Y2, - - -, Yn) and construct from it the following 
two decompositions: 


z 1 ot 7 2 2 
w= ` Ajwa w= Nya p+) Wki- 


j® k(n-p+1) 
where j® = iiiz: -ip € IH := hxhx: x Ip, while the indexes 
kort) — iptitpt2***tn € Japri Ipi X Ipp2 X +++ x In and 


(n) 


AÀ 
io ae 5y i) I an y (n) 
Wj) := i Wi (nm) ; Ajo) := Ayn) 


K(r—ptDep(n—pt1) © J) k(m—Pt) Ep (n—p+1) 
5) 
2 on im) 2 a5 (n) 
Wy(n—pt1) = ` y , Akopt1) — X ) Ai i 
jerr) ket) j ErP) 
. 1.— fyl 2._ fy2 
Since AY := {bun —— and A? := {Apm }pin-pinigzin-pt1) ATE 
marginal distributions of A™) = ene \, (2.88) yields 


H(A™) < H(A!) + H(A?) whence 
Ho (71: V2, a , Yp) + Hy, eis Vesa ae ia) > 


{v} i 1 
= Ay ( Ajon W500) “(p) 
GPE?) 


{Yj} Fep41 2 2 
+ A, {Ap wanes _ ie | ebinianesieeday 


{j= 
> Ho” 7 (Aw Pener) 2 Hy (Fis arsses en) =ne. 


6. Property (8.15) follows from the monotonicity of the relative entropy 
under CPU maps. 

7. Finally, property (8.16) is a consequence of (8.15) and of the fact that, 
given an ¢-optimal decomposition for the left hand side, one may con- 
struct a decomposition for the right hand side as in the first part of point 
3 above. 


Proposition 8.1.4. /216] Given a Cx algebra A equipped with a state w, 
let {yj} 721, Yn be CPU maps from finite-dimensional C*-algebras {M ;}"_, 
into A, then 
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Ho (71, V2; + ++) Yn—11 In) — Hu (9152) -- +) Yn—1s Yh) < Hw (Yn | 74) (8-20) 
Ho (u| y2) = sup H% ({Ai, wi hier) (8.21) 
w=) ier Aiwi 
H ({A;, wifier) : =A (s( (wio yi, WON) — 
icl 


-S (aote, woa) (822) 


Proof: Let w = J mwer A Win) be an s-optimal decomposition such 
that, as in (8.8), 


{7} (n) 
Ho (i Tarea in) L Ho T (100) ,w wb renin) anes 


then, according to (8.7), 


Ho (V1; Y2 -+ Vey Yn) — Hw (V1, V2; - -< Yn=1 Yn) L 


{yhje (n) 
< Ho i (A Peera) 7 


1H UY 
no ND uga + ne 
imer) 


< La(s wh om, wor) — S(wh oye, w072)) + ne. 
in€In 


Since n is fixed and e€ is arbitrary the result follows. 


Example 8.1.2. If A is an Abelian C* algebra and 71,2 are the natural 
embeddings of two finite-dimensional Abelian C* algebras A12 C A, then 
Ho (A; | A2) reduces to the conditional entropy of the random variables 
Aı associated with the minimal projections {@;}ier,, 4; =1,2,...,d, and 
{@2; tjer, Ig = 2,...,d2, of A12. These projections ave rise to probability 
distributions w 1A, = = {w(i}; and w [Ae = [unt 2 and can be consid- 
ered as the outcomes of A1, 2. Accordingly, the set of expectations w(G@1;@2;) 
corresponds to the probability distribution of the joined random variable 
A, V A2. Consequently, by using the minimal projections of A, to decompose 


w= 5 w(@u)wi, wila) := olaa) Vac A, 


tel, was) 


NON d: 
w(G1;G2;) \ i 
a 


it turns out that w;[A1 = {wi(G1z) = din Y] and w; | Ae = { = 
w(G1;) 


j=1 
Then, by means of (8.22) and of (2.91) one gets 
Hor? ({w(Gii),witier.) = H(A1) — H(A2) — H(A1) + H(A V A2) 
= H(A, V Ag) , 
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where H(Aj,2) := S (w |A1,2) are the Shannon entropies of the random vari- 
ables A; 2. We now show that no decomposition can do better; indeed, in the 
Abelian case at hands, decompositions of w correspond to partitions of unit 
with positive elements {ĉk}kex in A such that 


w= So (e)ur, wla) = NCES) aie A, 
keK w (Cr) 


Finally, Corollary 2.4.1 and strong subadditivity (see Proposition 2.4.1) yield 


HA1A2 ({w(Ck), WebreK) = H(A) = H(A, V C) = (Ag) -+ H(A2 V C) 
< H(A) — H(A, VC) — H(A2)+ H(A, V Ao VC) 
< H(A, V A2) — H(A2) = H(Aı|Aə) , 


where C, Ay VC and Aə V C are random variables with probability distribu- 
tions {w(Cy) trex, {W(Ce Gi) fier kex and {w(Cy G25) }jeIo,keK- 


CNT Entropy Rate and CNT Entropy 


Apart from the relation (8.13), all other properties in Proposition 8.1.3 regard 
n-tuples of arbitrary maps y without reference to the dynamics. Since the 
purpose of the CNT entropy is to quantify the information production in 
a given quantum dynamical triplet (A,O,w), we set yj := OÍ o y, where 
j=0,1,...,n— 1 and y: M+ Ais a CPU map from a finite-dimensional 
C* algebra into A. The first step is to ensure the existence of the rate 


lim =T (7,0 07,...,0t 07) ; 


no n 


This limit exists since (8.14) together with (8.13) yield 


Hw (y,A07,...,0" * 074) < Hy (1,0 07,..., 0107) + 
+ Hu, (OP oq, OPt1 oy... 071 oy) 
= Hv (7,0 07,..., 0t oy) F Hou (yO oy.jO" ? * 04) # 
Thus, one can apply the same argument already used to show the existence 


of the classical entropy rate (3.2) or of the mean von Neumann entropy in 
quantum spin chains: actually, 


lim Ti (Y,Ooy,..., 001 oy) = inf ŽE, (4,0 076.78" oy) ; 
(8.23) 
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Definition 8.1.2. Given a quantum dynamical triplet (A,O,w), where A is 
a C* or a von Neumann algebra and a CPU map y : M+ A from a finite- 
dimensional C* algebra M into A, the CNT entropy rate of y is 


1 
heNT (0,7) = lim —H, (7,9 07,...,0" 1074) , (8.24) 
n=œ n 


while the CNT entropy of (A,O,w) is defined by 


bo (O) = suphg™T (0,7) . (8.25) 
ey: 


There are a few properties of the CNT entropy that easily follows from 
the above construction. 


Proposition 8.1.5. The following bounds hold for the CNT entropy rate of 
a quantum dynamical triplet (A,O,w); given any CPU map y from a finite 
dimensional C* algebra M into A, one has: 


0 < hT (0,7) < Hu (7) (8.26) 
1 
LONT (Er, 4) < hT (8,7) <bENT(",y). (827) 


Proof: The bounds in (8.26) come from the properties (8.10) and (8.13) 
of the n-CPU map entropies. The upper bound in (8.27) follows from sub- 
additivity (8.14) together with property (8.13); indeed, since the limit (8.24) 
exists, one can fix N 3 n > 0 and compute 


1 
hoT = lim —H aaO 
» (0,7)= im ppe (pOor 0 o) 
< u lim or (o 07,0" oy or- D+ o 7) 
~ nkoto k os = , os 
1 ; 
= im 7H (7.6" oq... OPED 0 7) =h" (0”,7) . 


For the lower bound, first notice that n-CPU entropies remain unchanged by 
adding to the maps 7; in their arguments any number of CPU maps Vj from 
trivial finite dimensional C* algebras {cll;} 3 into A. Indeed, for any such 
CPU map H,, (7) = 0, thus by subadditivity, 


Eye ay Yasir nga) < Ha (i Yrsa) 


However, any optimal decomposition for the right hand side of the previous 
inequality can always be used to decompose w in the left hand side, too. This 


3That is algebras consisting only of multiples of an identity operator l; 
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decomposition can then be used to invert the previous inequality; then, one 
expands O1” o y into the set 


PD := for oy, Otil on, .., Intnl o7} , 


where y = yo y’ and y’ embeds the trivial subalgebra of M into M. Us- 
ing (8.15), one finally gets 


Sa (>, O” 07,...,E"F-D o 7) = THe (POr, a ay) 
1 j 
< nz Hw (7,0 07,...,08"1 04) ? 
whence the result follows by taking the limit k — +o. 


As much as for the KS entropy, one needs a means to avoid computing 
the supremum in (8.25). The structure of AF or UHF C* algebras or hyper- 
finite von Neumann algebras resembles that of classical dynamical systems 
admitting a generating partition (see Definition 2.3.5) Indeed, by using the 
continuity properties of the n-CPU entropies discussed in Proposition 8.1.2, 
one can prove a non-commutative counterpart to the Corollary 3.1.1 of the 
Kolmogorov-Sinai theorem 3.1.1. 


Proposition 8.1.6. /88] Let (A,O,w) be a C* quantum dynamical triple 
which admits a sequence of CPU maps Tj : Mj +> A and cj : At Mj 
from finite-dimensional C* algebras with identity into A and back such that 
limj—+00 ||7} ° 7;[A] — A|| =0 for all A € A. Then 


ne’? (6) = lim hT (0, r;) . 


j—+oo 


Proof: Let y: M+ A be any CPU map from a finite-dimensional C* 
algebra M into A; set yj := Tj o gj o y. Then, y;(M) — y(M) in norm 
for all M € M whence |y; — y|| — 0 when j — +00 for M is finite- 
dimensional. The same is true for the CPU maps O* o yj and OF oy, k > 0. 
Since ||O* o (y; — )||u < |O" o (y; — 7)||, Proposition 8.1.2 yields 


Tja (YO oyj., O1 oq) -H7 E 0%)| se 


for any € > 0 and j sufficiently large, whence 


lim bo” (0,99) = bo" (0,7) - 
J—> too 


Now, using the monotonicity property (8.15), it turns out that 
hSNT (0,7) = liminf hT (6, 7;) < liminf hON™ (6, 7;) 
j—+oo j— +00 


< limsup hONT (6, 7;) < hT (6) . 
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The result thus follows by taking the supremum over y. 


In case A is an AF or a UHF C* algebra, namely the norm completion 
of an increasing sequence of finite-dimensional C* subalgebras Mn, C A or 
matrix algebras M,,,(C) (see Remark 7.1.1), one chooses as CPU maps cj 
the corresponding conditional expectations and, as the CPU maps 7;, the 
natural embeddings ım, : Mn, > A [88, 89, 231, 232]. 

A similar result as in Proposition 8.1.6 holds for von Neumann quantum 
dynamical systems with A a hyperfinite von Neumann algebra (for the proof 
see [88, 222]). 


Proposition 8.1.7. Let (A,O,w) be a von Neumann dynamical triple, with 
A hyperfinite and generated by an increasing sequence of finite-dimensional 
von Neumann subalgebras {M k}ken; then, 


hONT (6) = lim hT (O, Mx). 
7 k— +00 ad 


Remark 8.1.2. When a quantum dynamical system under has the algebraic 
structure as in the above proposition, then the continuity properties of the 
CNT entropy allows to turn the lower bound in (8.27) into an equality [88], 
namely 

ho’? (6") = Inh (0) Ynez. 
Moreover, this result can be extended to a one-parameter group of automor- 
phisms {0;}er, that is [214, 222] 


ho’? (O4) = |thgN*(@) VtER, 
where O := O;,-1. 


8.1.1 CNT Entropy: Quasi-Local Algebras 


As seen in Section 7.1, in quantum statistical mechanics one often considers 
quasi-local algebras A which are generated (inductive limit) by local algebras 
Avy, indexed by finite volumes V, that are not finite dimensional. For instance, 
this is the case with Bosons in R° or with a lattice Z? with infinite dimensional 
Hilbert spaces at its sites; in such a setting one cannot resort to either of the 
preceding two propositions to compute the CNT entropy (8.25). 

Nevertheless, a quantum Kolmogoroy-Sinai-like theorem holds under the 
following physically plausible assumptions [232]; we shall consider dynamical 
triples (A, O,w) consisting of 


1. a quasi-local C*-algebra A which is the norm completion of [Jy Ay where 
the local algebras Ay associated with finite volumes V C R? share a same 
identity and are isomorphic to the von Neumann algebras B(Hy) 4; 


“In the following we shall restrict to R? for simplicity; the result holds in general 
for R? and Z*, d > 1 [232]. 
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2.if Vc Vs set V” := V’ \ V, then Hy: = Hy Q Hy, Av = Av Q Avr; 
3. the O-invariant state w is locally normal, that is w [Ay is a density matrix 
pv € By (Hy). 


Let iy denote the embedding of Ay into A; as shown in [231, 232], using 
the second assumption one can construct a family of CPU conditional expec- 
tations oy : At Ay such that ||ıy o ay[A] — A|| — 0 for all A € A when 
V 1 R°. Consider a CPU map y : M +> A where M is a finite-dimensional 
unital C* algebra and set yy := ty © oy o y. Now, limy;rs ||yv — 7|| = 0 for 
M is finite dimensional whence Proposition 8.1.6 yields 


Jim, BONY (9, 9v) = hg" (0,9) - 


From (8.25) and the previous equality one derives 


hSNT (6) = sup lim hONT (6, yy) < lim sup sup hONT (0, yy) 
ao VIRS VIR? y 


<limsup sup ho^" (O, 1v o Av) < ho™T (0) , 
VTR3 Av: MrAy 


where the second inequality holds for not all CPU maps Ay : M+ Ay are 
of the form of yy. Thus, 


noNT (9) = tim, sup hONT(6,yy) . (8.28) 
yv:M= Avy 


Fix a volume V C R? with local density matrix py = Y ri| ri )(ri | 
where the eigenvalues rj, are repeated according to their multiplicities and 


decreasingly ordered. Let pi) = rire |, QP = 1- PP and 


AP = PP Ay PP o CQ . 
The latter is a finite von Neumann subalgebra of Avy; consider the map 


w(  AOy) (k) 


o [A] := PP A PP 4 (om) V VAEAv. 
Wy 


It linearly maps Ay into AG y is unital and positive; further, if A € Ay and 
A®) >B= pi) zP + cB oy, with cg € Cand Z € Ay, 


oP [AB] = PP APP ZPP + cp E E 


Therefore, according to Definition 5.2.3, o% ), Ayr Al ) is a conditional 
expectation; then, for any CPU map yy : M+ Ay set Ty := ty o yy and 
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T) := iyo i) o of) o y, where AP) is the embedding of AP into Ay. We 


now show that, for any yy : M+ Ay, 
BENT (@,7v) = lim nN? (0.7) (8.29) 
—+o0o 
This lead to the result that, in order to compute the CNT entropy, one 


must essentially compute the CNT entropy rates of the finite-dimensional 
subalgebras projected out by the spectral projections of local states. 


Theorem 8.1.1. /232] Under the assumptions 1 — 3 on (A,O,w), 


nGX™ (0) = lim lim wENT (6,4) . 


Proof: Writing hONT o, Al”) = het (O, iyo iy and using (8.29 
w w 
and (8.15) one gets 


lim hONT (0,4?) < sup hT | O, y oyy 
eee) 


k— +00 yv:M= Ay 
Ty 
k k 
= sup lim sid O,ly o ih) o at o yy 


yv:Mm Ay E+ 


(k) 
Ty 


IA 


lim sup p NE (o, aT) 
n— +00 yv:M= Avy 


< lim net (o, ly 0 i?) = lim bo (0,4?) ; 
k—+00 k—+00 


Therefore, the result follows from (8.28) and the above estimates which yield 


sup hONT (Oyo qv) = lim hONT (o,a) f 
yv:M= Av k—=+00 


Proof of (8.29) We argue as in the proof of Proposition 8.1.2; we thus set 
yj = OF o Ty, yj = Of o r) and estimate the norms 


w(X!OI ory[M]) — w(X104 o rP [M]) 
w(X!) w(X!) 


a a 


> (*) 


where a short hand notation for (8.5) has been used and the X/ are positive 
elements of the commutant 7,,(A)’ such that $7, Xj = 1. 
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By writing Ay 3 X = (P®@ +Q) X(PE ae (™)) and setting v;(M) := 
w(X/@ o ty[M]) and v\*)(M) := w(X/03 o 7") [M]), one finds 


v,(M) — vP (M) = (O XQ wm] OY) 


A 
+w(O~F[X/]P iM] QY) + OXP iM] PE) 
et OE gee ee i a et EG eee 
B C 
(k) (k) 
m e Zev wiv) 
(k) 
x A 
D 
Since 0 < O-4[X!] =: Z € Tu( A)! commutes with the projections Y = 


P QË, one can write ZY = VZYVZ = /ZYVZY; thus, using the 
Cauchy-Schwartz inequality (5.49), one estimates 


A? < w(ZQY) w(ZQM w MREP mM] 
< W(X; (ZRP? MI? 

BP < (ZPP) w(ZQy [MPE wiMl@y’) 
< w(X4) w(ZQ) vl? M]? 

CP < u(ZP) (ZRG WIMPY wIMtQy) 
< w(X}) (ZPP)? M]? 

DP < W(X} o(ZQP) lvl? MI? , 


where (5.33) and Y < 1 => X'TY X < X' X have been repeatedly used; 
further, in the expression C, Z has been transferred to the right side of w(-) 
before applying (5.49). 

Since 5°, X/ = 1, these estimates obtain 


1 k _; k 
Lagal- vPI < 16 |lyvll Y AKG) = 16l? Z ri 
i a i j=k+1 
<e 


for and £ > 0 and k sufficiently large. Notice that the summands are w(X;) 
times the squares of the norms (*); we can now distinguish between the set 
I of those i’s such that the norms (*) < e!/3 and the rest I°. Then, 


e> >> uf oat} ai, (X!) 


tere ie I¢ 


_ |)? 


w(X!) 


implies that the total weight of I° is smaller than ¢!/?. As in the proof of 
Proposition 8.1.2, this can be used to show that, for any 7 > 0, 
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[HL (THe ety «1.87 o Ty) — Hy (neon wage ony) | < nn 


for k sufficiently large. 


8.1.2 CNT Entropy: Stationary Couplings 


In this section we reconsider the expressions (8.21) and (8.22) in the following 
algebraic setting: 


— avon Neumann algebra A with a normal state w; 

- an Abelian von Neumann algebra B with a normal state w, °; 

— the tensor product von Neumann algebra A & B equipped with a normal 
state w such that its marginal states are © |A = w and © [B= wu. 


Let A C A and B C B be finite-dimensional C* subalgebras; as CPU 
maps 71, respectively y2 in (8.21) we shall take the embeddings y; = cp, 
respectively y2 = va of B, respectively A into A ® B. We shall focus upon 
the quantity Hg (B | A): one has the following result [210]. 


Proposition 8.1.8. Let B be the random variable corresponding to the sub- 
algebra B and H,,(B) denote the Shannon entropy corresponding to the von 
Neumann entropy of the state w, restricted to B. Then, 


Hz (B | A) = H,(B) - S(w9w, A9 B, A8 B) . (8.30) 


Proof: Given the minimal projections {by jern, Ip = {1,2,...,d} of 
the finite dimensional Abelian algebra B C B and a convex decomposi- 
tion © = J jez AiWi, one can construct a finer decomposition of the form 
w= 5 AihijWij, by further decomposing w; = Sets HijWij, where the 
tEl;jelp 

states W;; on A Q B and their weights pi; are defined by 

b;b) ~ 
j) 


ne & 
Wijla b) := s 


( 
for all a € A and b € B. Since for alli € I 


> Pij = Wi(bj)  (*) 


wila 
Wj 


J 


a wi 

wi (bx) = Wi (bjbr) = Ojik 3 
the restrictions @;; |B are probability distributions Aij = {ðjk}kers with 
zero Shannon entropy. Then, using (8.22) and (6.23), one computes 


5 According to Section 5.3.2, the state Ww, corresponds to integration with respect 
to a suitable measure u and measure space ¥. 
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HE (ihi, Wig ier jers) = Hyu(B) — S (wl A) 
ae 5 ri [ig S (wi; | A) (xx) 


jEIsB 
HP Ai Di hier) = H, (B) — S (w M) 
+ Joa (3 (& A) - $ @:1B)) - 
icI 
It thus turns out that 


B,A ~ B,A 
Hz ({Ailij, ij bierjetn) — Hz” (Ai Wi fier) = - 5 Ailij log Hij 
t€l;j€lB 


vel 1€1jelp 


Indeed, since pijWij < Wi, the monotonicity of f(x) = logx as an operator 
function (see Example 5.2.3.9) gives 


5 big Wig [A log Wij A = 5 bijWig [A (log (mi Wij 14) = log 13) 


jEIs jEIsB 
s 5 hijWij lA (log wi fA — log 13) 
jEIB 
= 0; lAlog w; |A — 5 Hij Wig [A log Hij - 
jEIsB 


Therefore, after multiplying by the weights A; and summing over i € I, by 
taking the trace and considering that the marginal state w,; [A has trace 1, 
one finally gets 


XO Opus Big TA) <- ASA) JO Aii log pig - 
iE€I;jEIB ier iE€I;jEIB 


One thus concludes that in order to compute Hg (B | A) one can start 
with decompositions of © in terms of states of the form (*). Then, consider 
the decomposition © = ` -z VjWj, where 

jelig “Ij 


guong Nau, v;:=0(b;) acA, bEB. 
j ib) j j 
j 


Since S (w; |B) = 0, this decomposition contributes with 
HE’ ({Aj, Di }jers) = Ha(B) — S (wA) + X nS (lA) (+*+). 
jEIsB 


Further, notice that the decomposition appearing in equation (**) is such 
that 
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~ Aity = as 
So Aij = 45 = wy (bj), DOG =G,. 
icI icI 
Therefore, by the concavity of the von Neumann entropy (see (5.156)) 
S Ami Sig A) < J vs OM) , 
jEIB jEIesB 


whence decompositions of the form (* » *) are optimal. The proof is finally 
completed by calculating 


S(w®w,[A®B,GlA®B)= 
n(otaen, |B (logw lA 8 wu B — cata 8) ) zZ 


= -H,(B) - $(wtA) + Ð vy Tr(@; [A log (vj; 1A) = 


jEIs 


= Š vj S (©; lA) — SwA) . 


jEIs 


The previous considerations are useful in a different approach to the CNT 
entropy which was developed in [264] (see also [222]). We shall refer to the 
formulation used in [13] 


Definition 8.1.3. Let (A,O,w) be a dynamical triple with A a hyperfinite 
von Neumann algebra and w a normal O-invariant state; a stationary cou- 
pling to a commutative dynamical triple (B,0,w,,) where B is an Abelian von 
Neumann algebra, is any triplet of the form (A ® B,O ® 6,) where © is a 
O ® 6-invariant state such that w |A = w and {B= wy. 


The quantity Hg (B | A) and its expression (8.30) can be generalized as 
follows [210]. For any finite dimensional subalgebra B C B let 


Hs (B| A) := sup. La(s (tB, OIB ) = S (5: A, © A)) (8.31) 


W=X;0; i 


= S (w, ÌB) - S@MeB,weu,[A@B). (8.32) 


It then turns out [264, 222, 210] that 


noNT (9) = sup {niss (0,B) — Hs (B| A)}. (8.33) 
B,B,0,0 


where the supremum is computed over all possible stationary couplings and 
all finite-dimensional subalgebras B C B. Notice also that in the expression 
of the KS entropy, in according with Section 5.3.2, we have kept the algebraic 
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notation whereby B stands for a partition of a phase-space ¥, the automor- 
phism @ for a measurable, invertible dynamical map T : X + Æ and the 
state w, for a T-invariant measure p. 


Remark 8.1.3. The relative entropy as it has been used so far has always 
involved density matrices or restrictions of states to finite dimensional subal- 
gebra, whereas in (8.31), because of the presence of the generic von Neumann 
algebra A, it apparently works in a more general context. It turns out that the 
expression (6.23) for the relative entropy has a generalization to any unital 
C* algebra A and to generic positive, linear functionals (not even normalized) 
w1,2 on it [88, 222, 300]: 


t dt ae 


S (on, wa) = sup f T T4+t 


g - wn(ult)t uA) - oel Od] , 


where y(t) = 1 — z(t) and t+ z(t) € A is any step function with values in 
A vanishing in a neighborhood of t = 0. 


8.1.3 CNT entropy: Applications 


We start the presentation of various concrete applications of the CNT entropy 
by showing that in a commutative context it reduces to the KS entropy. 
Consider a classical dynamical system (Æ, T, u) that possesses a generat- 
ing partition P = {P;}?_, (see Definition 2.3.5) and its corresponding von 
Neumann triplet (M := L% (X), Or, wu) (compare Definition 2.2.4). In this 
framework, the partition P is identified with the finite-dimensional subalge- 
bra Mp generated by the characteristic functions yp, of the atoms P; of 


P. Furthermore, the partitions PË, := Viz- p T-I (P) (which generate the 
X-algebra of ¥ when k — +00) correspond to the Abelian finite-dimensional 
subalgebras Mp := Vg O} (Mp) generated by the characteristic func- 
tions of the atoms of P*, (these subalgebras generate M). We can thus 


apply the argument of Example 8.1.1.2 to deduce that 
Ho, (My, O@r(My,...,O%-1(My)) = S (on Me) -s5 (wy jagen) 
= Hier) : 


where we used that w, is Or-invariant and that 


n—-1 n+k—1 , n+2k—1 , 
MP = V oM) = V ohM)= an V Ə} (M)), 
=0 =—k j= 


together with (3.1). Thus, from Theorem 3.1.1, 
i (O, Mx) = hý” (T, P) = bhis (T) . 


Then, hONT (O7) = i (T) follows from Proposition 8.1.7. 
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CNT entropy: Finite Quantum Systems 


For finite-dimensional quantum dynamical systems, the C* algebra A is a 
matrix algebra M,(C) and the states density matrices p € Ma(C) with von 
Neumann entropy always bounded from above by logd. All these systems 
cannot support a non-zero CNT entropy rate, in agreement with the fact 
that their dynamics, given by a unitary U € Ma(C), is quasi-periodic and 
shows the behavior discussed in Remark 7.1.7, at the most. We shall prove 
this by considering the slightly more general scenario studied in [39]. 


Proposition 8.1.9. Let (A,Os,w) be a quantum dynamical system with 
A = B(H), w corresponding to a density matriz p € B} (H) with finite von 
Neumann entropy S(p) and invariant under the automorphism © such that 


B(H) > X + O[X] = į” Xe” , 


where the Hamiltonian H has a discrete spectrum. Then, h$N™ (O) = 0. 


Proof: Let P™) be the projector onto the subspace of H spanned by the 
eigenvectors relative to the first n decreasingly ordered eigenvalues of H and 
Q™ := 1— P™. Then A is generated as a von Neumann algebra by the 
increasing sequence of subalgebras A(”) := P(/™ A P™) ©CQ™. These sub- 
algebras are O-invariant; also, they diagonalize p for it commutes with H 
since w o O = w. Then, from Proposition 8.1.7, (8.12) and (8.10) it follows 
that 


hoNT (@) = lim hONT (0,4) = lim i (A™, AM Ae) 


n—+0oo n—+oo n 


ia =H, (4) 2 ii, s (p pan) =0. 


n—-+oo n n-+oon 


CNT Entropy: Quantum Spin Chains 


The algebraic structure of a quantum spin chain (Az, 9,,w) is such that we 
can apply Proposition 8.1.6. Let then {Aj_¢4}cen be an increasing sequence 
of finite-dimensional local subalgebras that generate Az, then 


hONT (6,) = Jim bh (O3, Ajeg) = jim ho (O3, Ana). 
where the second equality follows from the translation invariance of w and 


the property (8.13). Further, since OL [Ana] = Ansjety] C Ap,e+n-1] for 
0< j< n-1, using (8.15), (8.12) and (8.10) one derives 


Ho (Au, Qo(Atigq)s---, O37 (Apg)) < Ho (Apern—ys +++ Ap, t4n-1)) 
= Hy (Ap e4n-11) < S (Pretn—1]) » 
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where piije+n—1] is the local density matrix corresponding to the state w 
restricted to the local subalgebra Aj e+,—1)- By using (8.24) and Exam- 
ple 7.2.1.1 one thus concludes with 


Proposition 8.1.10. hONT (O,) < s(w) for any (Az, Oo, w). 


In order to show whether and when henT (O) > s(w), we use the fol- 
lowing strategy. Consider a local subalgebra Aj, 9) with fixed £ > 1 and set 
n= kl+p,0 < p< Lin (8.24); using Definition 8.1.2 and property (8.16) we 
get the following lower bound: 


hEN* (Oc) > lim hENT (05, Ana) 

Be 1 z 

> lim ET (Ana Oo lAng) -- - OPH An a)) 
2 k-1e 
> Jim, yl 1 (Aun Ana sO Ana) 
1 

= im, pole (Ap, Apesi,2¢9 ++» Ale—1)e4-1,4¢)) 

f Arna = 
> im aps ere nah v (Asoo su wh), (8.34) 


where we used (8.7) with w = J ;œ) Ayn) W,(e) any chosen decomposition 
adapted to the k commuting local subalgebras Ajje+1,(;-41)q- 


CNT Entropy: FCS States 


If a quantum spin chain is endowed with a finitely correlated state w as 
defined in Section 7.1.5, then its CNT entropy coincides with the entropy 
density s(w) (see Section 7.2) [133]. 


Proposition 8.1.11. Let (Az,O,,w) be a quantum spin chain with a FCS 
w, then hONT (6,) = s(w). 


Proof: Because of Proposition 8.1.10, the result follows if we show that 
h¢NT (6,) > s(w); for this we use the lower bound (8.34) and Remark 7.1.15. 
Therefore, we fix a local subalgebra Aj, 4; since altogether the arguments 
of the k-subalgebra entropy in (8.34) generate the local subalgebra Aj x4] 
we need consider the local state pjk. We start from a decomposition as 


in (7.101) with n = kl and regroup the indices 7 as follows 
(ke) oa : ; ; ; ; 
GO? = jija: je jerrjere tt jae Ih—erI(h—1ye+2 °° jee + 
a ee Ni, oa 5 


al i2 tk 
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Notice that the index j of the Kraus operators in the CPU map E defining 
the FCS w runs over a finite set J, whence j* e Ik, while each i, in the 


regrouped index i) = i112+++t, belongs to the index set I6; thus i”) € Ti; 

J 
We have seen in Example 7.1.15 that the weights p(j"®) assigned to the 
indices 7 (k£) give rise to a compatible family of local probability distributions 
wk) and that these define a global shift-invariant state w, over the classical 
spin chain ge, We then construct the decomposition w = J ce) Aj) Wi), 


where A, x) := p(t) and wj) = aie and use it to compute 


{Apea to _ (k) 
Ho {A;a , Wyte) } = 5 n(p(i j= 


iM- 

aM 
=. 
R. 
= 
rd 


tere, ijEI6 
J 
k k l 
+ 928 Mig—nerisa) -D D Pis (wi, Mig-nesa) » 
j=1 j=lijert 
where, with the notation of Example 7.1.15, 
;(k) ; 
m , pü) w a 
rae XO pa), wl = PIC) Pik) = Pia ° 
i(k) erk, i(k) erk, p J 
I I 
i; fied i; fiwed 


From translation invariance of FCS it follows that wfA(j—1ye+13q = Pu, 
‘ .(£) ihe . F 

wi, MAig—1)e¢i = Piae and p’ (ij) = pj) for some j € IS. There- 

fore, (8.34) reads 


hT (6,) > lim l > nine} -5 X nG) 


k— co 
jO EI, 


1 1 0 
jOE 
In the limit l — oo, the second term in the first line gives the Shannon 
entropy rate of the classical spin chain oa, Os,wr) as well as the limit 
k — oo in the first term, while the first contribution in the second line gives 
the von Neumann entropy density of (Az, Os,w). Thus, 


1 ; . (8) 
hENT (Os) > s(w) — lim 7 5 25) (Phin) i 


ł— oo : 
jOE 


The proof is then completed by using that, as discussed in Example (7.1.15), 
S (Ca) < 2log £. 
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CNT Entropy: Price-Powers Shifts 


The non-commutative shifts discussed in Section 7.1.5 offer an interesting 
variety of behaviors of the CNT entropy [13]. 

By construction the quasi-local algebra A, is generated by the Abelian 
ey 


algebra A, consisting of the orthogonal projections and by its images 
An := O? (A). These are also Abelian subalgebras, but in general they do 
not commute with each other; moreover, denoting by Mpp the subalgebra 
generated by Ay, £ = 1,2,...,k, these generate the von Neumann algebra 
Mg in Example 7.1.17.1. Therefore, one can compute h$NT (Or) by means of 
Proposition 8.1.7. Notice that, because of (7.122), the Mẹ can be represented 
as subalgebras of the spin algebras M2(C)®*; this fact allows to derive a 
bitstream-independent upper bound to the CNT entropy. Indeed, by using 
the properties in Proposition 8.1.3, one estimates 


Hy (My, O5(Mx), paty 9”"—"(M;.)) < Ho (Mn+k-1) 
< Ho (m (c)eett-D) = (n+ k— 1) log2 whence 
hoNT (0,, Mp) < log2 => hONT (0,) < log 2 . 


We discuss a few particular cases, a thorough analysis of the dependence of 
the CNT entropy on the bitstream being provided by [222]. 


1. For a bitstream g = 0, (m gO w) amounts to a classical Bernoulli shift 
and 
hit, (Os) =log2. (8.35) 
2. If g(n) = 1 for all n > 1, then M, describes a Fermi system on a one- 
dimensional lattice at infinite temperature (see Example 7.1.17.3), where 
pairs €2;, €2i+1 give rise to annihilation and creation operators aj, al ful- 
filling the CAR. Since n such operators generate an algebra isomorphic 
to Mə» (C), an argument similar to the one that gave the universal upper 
log 2 yields 


Ho (Mx, 0,(Mx), Risin ,O" *(M;)) < Hy (Mn+k-1) 
< Ho (ma (C)e i+- = = log2 whence 


log 2 log 2 
noNT (9,, Mx) < = = hoNT(9,) < A 


? 


where we have set [j/2] = j/2 for j even and [j/2] = (j + 1)/2 for j odd. 
On the other hand, (7.121) implies 


J= ’ = e] = j—-1©25 (= 
[e2 1€2j , €2k 1€2k| (ej 1€2;) (€2k 1€2k) x 
y (1 - (1920 k)+1|)+g(12(j—k) 1D) =O). 
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for all j,k > 1; therefore, operators of the form e2;-1e2; commute. Let A 
denote the Abelian algebra generated by ee (which is isomorphic to the 
diagonal matrix algebra D2(C)); then, the Abelian algebras {021 (A) = 
commute and generate an Abelian algebra isomorphic to the diagonal 
matrix algebra Dən (C). Then, using Example 8.1.1.1, one gets 


H, (A, @3(A),...,63"-1)(A)) =nlog2. 
Finally, (8.27) yields 


BONT (8) > RENT (@,A) > GHENT (63 


whence 


log2 
BENT (0,) = = 


(8.36) 


. In the case of an asymptotically highly anti-commutative Price-Powers 
shift [220], for any W; E€ A, there exists an infinite set I(i) of integers 
such that 


{orwi], OPW} = Win Wirm + WirmWh,, =0, 
for all n,m € I(t). We shall show that 
hol (0,)=0. (8.37) 


In order to do that, we shall consider a stationary coupling of (Mg, Oo, w) 
to commutative dynamical triple (6, 0,w,,) (see Definition 8.1.3 and the 
preceding discussion), namely a triplet of the form (A&B, 080, ©) where 
wis a O, @6-invariant state such that © [A = w and © ÌB = w,. Let p € B 
be any projection; then, 


{Ww}, 26], Wi 0p} = {wi Wi} @ 6"[p1e™[p] = 0 
for all n,m € I(t). As done in Example 7.1.17.5, by setting 
1 N 
i=1:n; EI (i) 


for an arbitrary N € N, one estimates 
Ewe p| =o <y. 


Since N is arbitrary, we deduce that (W; & p) = 0 for all W; E€ M, and 
all p € B which means that the global state factorizes: w = w ® wp. This 
fact in turn implies that the relative entropy contributions in (8.32) vanish 
so that Hg (B | A) = S (w, |B) for all finite-dimensional subalgebras B C 
B. Since he? (0,wu) < S (w, |B), the result follows from (8.33). 
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CNT Entropy: Quasi-Free Bosons and Fermions 


For quasi-local algebras of Bosons and Fermions in translation-invariant 
quasi-free states as those considered in Example 7.2.1.3, one can consider 
the discrete space-translation group © := {On}nezs (see Example 7.1.2.1) 
and enlarge the scope of Definition (8.1.2) to cover the fact that there are 
now three directions along which the n-CPU entropy (8.3) can increase. 
Given a CPU map y : M+ APF from a finite-dimensional unital C* 
algebra into the Fermi, respectively Bose quasi-local algebra, a natural way to 
proceed [233] is to consider, for each k = (k1, k2, k3) € N3, the parallelepipeds 


B(k) = {ne N° l 0<nj<k,i=1,2,3}, 


CPU maps of the form On o y and then to replace (8.24) by 


1 
CNT — : 
his (0,7) = i ee ki kaka 9A ({On o Y}neB(k)) G (8.38) 


while keeping the definition of (8.25) for the CNT entropy of ©. Notice that 
the limit in the right hand side of (8.38) exists because of the subadditivity 
property (8.14) and the assumed translation-invariance of the quasi-free state 
wa together with property (8.13). 

With the same technical assumptions ensuring the result in Exam- 
ple 7.2.1.3, by means of (8.1.1) it can be showed that the CNT entropy of 
the space-translations coincides with the mean entropy [233]: 


hon (©) = oy 1 „dk (nRalk)) +71- Ralk))) (Fermions) 
N (0) = ane dk (n(Ka(k)) — (1 + Ka(k))) (Bosons) . 


Remarks 8.1.4. 


1. Quasi-free automorphisms in the Fermionic case have been considered 
in [214, 289]; the following result holds [223]: let f,g € Lig 5,)(dp) be 
single particle wave-functions for a Fermi system on a lattice ([0, 27] 


being the momentum space). Consider 
Oy(a*(f)) =a” (Uf), (Uf)(p) =e f(p) ; 


it defines a quasi-free automorphism over the CAR algebra with single 
particle energy w(p) assumed to be a real absolutely continuous function 
of the momentum variable p. Further, let 


2m 


w(al f)at (g))= | dp plp) f* (p) g(p) 


0 
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define a quasi-free Oy-invariant state over the system, with 0 < p(p) < 1 
a measurable one-particle distribution over [0,27]. Then, 


27 


ho^” (Ov) = A dp |w" (p)| (n(p()) + nl = plp))) 


where w’(p) := dw(p)/dp is the group velocity. The physical interpretation 
is suggestive [214]: for a quasi-free automorphism the dynamical entropy 
production as described by the CNT entropy amounts to a flux of single 
particle Fermionic entropy governed by the group velocity. 

2. While in one-dimensional quantum dynamical systems the 1/n factor 
controls the asymptotic increase of the n-CPU map entropies, this is no 
longer true in higher dimension. An instance of this fact is the previous 
result where one divides by volumes in order to avoid divergences due to 
the freedom to move in more than one direction. In general, that is in the 
case of the time-evolution in dimension > 2, it turns out that h$NT (O) = 
+o0; this problem arises already on the classical level and a possible way 
out is to consider space and time translation together [153, 39]. 


8.1.4 Entropic Quantum K-systems 


In Section 3.1.1 it was proved that the algebraic structure of classical Kol- 
mogorov systems introduced in Section 2.3.1 can be characterized by means of 
the dynamical entropy rate. In particular, from the proof of Theorem 3.1.3 it 
emerges that the equivalence between the existence of a K-partition, namely 
property (1) in the theorem, and the entropic properties (3) — (6) hinges 
upon property (2) that is the triviality of the tail of all finite-dimensional 
partitions. 

Algebraic quantum K-systems have been introduced in Section 7.1.4 as 
generalizations of classical K-systems; in this section, we present an entropic 
characterization of non-commutative K-systems that partially mimics that 
given in Theorem 3.1.3. This gives rise to a class of quantum dynamical 
systems with particular clustering properties, but in general not K-systems 
from the algebraic point of view. 

We start by considering the relations (3) — (6) in the above mentioned 
theorem and study how they are affected if one substitutes the n-subalgebra 
entropies for the Shannon entropies. For sake of simplicity, we shall restrict to 
the case of AF algebras A (see Remark7.1.1) so that we can consider finite- 
dimensional subalgebras A C A as arguments of n-subalgebra entropies, 
namely, we take the natural embeddings ¿4 : A — Aas CPU maps y. Also, 
we shall restrict to faithful states w so that the only subalgebra with 0 entropy 
with respect to w is the trivial one (see Lemma 6.3.1). 
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Theorem 8.1.2. Given a quantum dynamical triple (A,O,w) with A an AF 
C* algebra and w a faithful state, let {cl} C A denote the trivial subalgebra 
and consider the following statements 


1. the CNT entropy is strictly positive, namely for all non-trivial finite- 
dimensional subalgebras A D A # {cl}, 


hONT (@, A) > 0; (8.39) 
2. for all finite-dimensional subalgebras A D A # {cl}, 
lim hONT (@", A) = Hu (A) ; (8.40) 


3. for all finite-dimensional subalgebras A D A # {cl}, A D B and all 
sequences {jk}ken of positive integers, 


lim Jim inf[H, (B,6"*(A),...,6"**(A)) 


n—+00 k—--+00 


-H, (BE Age) =H, (B) ; (8.41) 


4. for all finite-dimensional subalgebras A D A # {cl} and all sequences 
{jk }ken of positive integers 


Vani mint [Hu (B, Orti: (A), ee ek A) 


n—+co k—++00 
-Hu (B,@"*7(A),..., A" +9 (A))| =0 = B= {c1} . (8.42) 
They stand in the following relations 


(8.40) => (8.39) 


to. 
(8.41) => (8.42) 


Proof:  (8.40)== (8.39) Because of the second property in Lemma 6.3.1, 
one can choose 0 < e < H, (A) and n such that, using the lower bound 
in (8.27), 


Ho (A) —e€ 


hONT (6, A) > ior (e", A) > >0. 
n 


(8.41)— (8.40) Consider the sequence {jẹ = (k — 1)n}k>1, from the as- 
sumption it follows that for any £ > 0 there exists ng € N such that, for all 
n > no, 


lim inf [H. (A,@"(A),6"*"(A),...,0"*(A)) — 


k—+00 
-Ho (6"(A), 67"(A),0°"(A),..., e"*(A))| = 
= lim inf [Hu (A,@"(A),...,0"*(A)) — Ho (A, @"(A),..., en(A))] 


k—+00 
2 Hy (A) SE 
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where property (8.13) has been used in the first equality. Further, since 


liminf = sup inf it follows that there exists pọ € N such that, for all p > po, 
k—+00 p>0 k2p 


A, := Hy (A,@"(A),...,0"(A)) — Hy (4,6"(A), M ena) 


> lim inf [He (A,@"(A),...,*(A)) 


k—-+oo 
-H, (A, @"(A),.. Tae -e > H, (A) — 2. 


Choosing p > po, one thus estimates 


i= 


1 — H, (A 
-Ho (A,0"(A),...,6"-D(A)) = = 4; + Hy (A) 
P Po p 


ACOE E P — Po 
e 24 + =R (Hu (A) — 2e) , 


Since the left hand side of the first inequality is always smaller than Ho (A) 
(see (8.26)), the result follows from the arbitrariness of € > 0 by letting 
p— +o. 

(8.41) > (8.42) This follows from property 2 in Lemma 6.3.1. 

(8.42)= > (8.39) When A # {cl}, then, the same argument used to 
show that (8.41)= > (8.40) implies that, for any € and sufficiently large 
n, h$NT(@”, A) > e. Thus, the lower bound in (8.27) yields 


1 
Ly (9; A) > hors (9”, A) 2 >0. 
n 


31% 


While in a commutative context the above relations are equivalent, there 
are no proofs so far that they are such also for quantum dynamical sys- 
tems. Also, the classical versions of relations (8.39)— (8.42) are equivalent 
to K-mixing which is the strongest possible way of clustering. If one wants 
to relate the behavior of the CNT entropy to the mixing properties of the 
dynamics, among the possible choices, (8.40) appears the more appropriate. 
Indeed, one knows that hONT (O”, A) < H, (A); this is due to the fact that 
the dynamics usually create correlations between past and future. Therefore, 
if, asymptotically, the equality holds as in (8.40) this means that for large in- 
tervals between successive events the system is affected by memory loss [216]. 


Definition 8.1.4 (Entropic Quantum K-Systems). /215] A quantum 
dynamical triple (A,O,w) is called an entropic K-system if, for any CPU 
map y: M |= A from a finite-dimensional algebra M into A, it holds that 


Jim hg? (6',7) = He (7) - 
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This choice is also convenient because the behavior of hSNT (@", M) is 
often more informative on the properties of the dynamics than hCNT (O). For 
instance, if (A,O,w) is an entropic K-system, then 


jim Ha (1, 6"(M),...,"*—-)(M)) =kH,(M) , (8.43) 
for all k € N and for all finite-dimensional subalgebras M C A. Indeed, 
from (8.23) it follows that 


heN" (6%, M) < 7H, (M,0"(M),...,6%-9(M)) < Ho (M) , 


whence the result follows by taking the limit lim:.4.0. 

The asymptotic behavior (8.43) is an effective expression of the memory- 
loss properties of the dynamics of entropic K-systems. Comparing the contri- 
butions in (8.3) to the n-CPU entropies, one sees that, for large t, the optimal 
decompositions w = 4c) AM, wim» for Hu (M,O*(M),...,0%*-)(M)) 
must be such that 


[Jim (H H(A®) -F Hn) :)) =0 (8.44) 
lim, ON, 5 (ohh OÏ (M ), w1O”(M)) =H, (M), (8.45) 


ijel; 
where A) = ON, J 008 Azgi= [Al ati and 


(k) 
j (k) j N 1 40st 
ijt = » Noo 4g Wie = >D j 


ilk) ij fixed ilt) ij fixed aj yt 


In fact, the difference in (8.44) can at most vanish, but never be positive, while 
each of the summands in (8.45) is bounded by Ho (M). This also means that 
the optimal decompositions for large n must be such that the corresponding 
sub-decompositions are close to be optimal for H,, (M). 


Example 8.1.3. Let (Ag,O,,w) denote a quantized hyperbolic automor- 
phism of the torus, where A is the C* algebra generated by the Weyl- 
operators Wọ( f), with @ =< a?s >, s € N; these quantum dynamical systems 
are norm asymptotic Abelian (see Example 7.1.12). By using the exponen- 
tial decay (7.81) of their commutators, we shall show that they are entropic 
K-systems [209]. 

Let y : M + Ag be a CPU map from a finite dimensional algebra M 
into A. Fix € > 0 and choose k such that 
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Ho (7) 2 HaT (0; q) es ii (y, Oho. Oko) =E 
1 k—1 
z z(aļaP) E H(Aja)) (8.46) 
j=0 
1k l 
+ TR (ul 0 @% 070 , woyo ) f (8.47) 
j=0 i; 


Notice that w is the tracial state; thus, its modular operator is trivial whence 
its decompositions can be chosen of the form (see (8.6)) 


w(Xj A) 
=a Ari 5 w,(A) = WX) 


for A > X; > 0 such that >), Xj = 1. Because of the norm-density of the 
Weyl operators within Ag, given £ > 0, we can reach 


p 
Ho (y) < SOAS (wiog, wog) te (8.48) 
i=1 


by means of a decomposition w = }7?_, Aiw;, where 


P 


Xj = Wolfi) Wolfi), XO Wolf Wolf) =1, 


i=1 

with functions such that f¥(—n) = fi(n). Moreover, we can arrange them 

in such a way that f;(n) = 0 for all 1 < j < p—1 if ||n|| > K, while 
|Wo( fill < €. 

As a trial decomposition for H,, (7, OF Oee a) , consider the pos- 
itive operators 


ilk) t 


ij 
X oe ba 1 Y) where 


YE) = Wolfio) BIW (a) 0 [Wo Si) > 


ilk) t 


where i) € R. They satisfy 3a» Xi = 1; further, 
j k 
Xit = X 
ilk) ij fixed 
j t 
= ei ( L (Yal) Xi; Vaya) (8.49) 


A E 


Yayain] = OK [Wol fua) e Oe? W (Fna) (8-50) 
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Using the tracial properties, the coefficients of the convex decompositions 
w= Doe AL tb, ot are 

AÍ = (XE 4) = W(X) =A (8.51) 
Therefore, in (8.46), H(A;,.) = H(A) for all 0 < j < k—1, where, in terms 


p 
of n(x) = -zlog x, H(A) = 5 n(A;). Therefore, 


k 1 k 
(BAP) - 0 AAs 2) = ZHP) - H(4) (8.52) 
In order to control the probability distribution AP consisting of the coeffi- 
cients AA p we expand 


Wolfin) = X. fi. (nr)Wo(nr), ny E€ Supp(fi,) - 


Nr 


By means of the Wey] relations (7.29), they thus read 


k-1 
k k * Ti nrp Tr 
Auman = E e m eres 
r=0 


no: MR-1 
Mo- Mk—1 
k—1 
x w(Wo es Hn; - m;))) where 
j=0 
k—1a-—1 
Plin} {m} = D> Y (o(B" ns, B™ ng) — o(B™ m, B”*ma)) 
a=1 b=0 
k—1 
w(Wo > Bi (ny = m;))) = Öse Bit(n;—m,),0 ? 
j=0 


where B = A? is the transpose of the dynamical matrix A. By expanding 
|n; —m,;) = y| a+) +6,| a_ ) along the eigenvectors of B, one gets 


k-1 k-1 
(© gja) las) + (© ya) la-)=0, 
j=0 j=0 


where a > 1 ad a7! are the eigenvalues of B, whence 


k-2 k-1 


w- + X ya OT =0=8 4+ Soya. 
j=0 j=l 


8.1 CNT Entropy: Decompositions of States 449 


Suppose 1 <i; < p—1 for all ij € i), then the f;, have compact support and, 
for t sufficiently large, the above equalities imply y,_; = 0 = do. Iterating 
this argument, one gets nj = m; for 0 < j < k — 1, whence 


k-1 k-1 
k 
R= YS [DA mr = ea). 
Tg2.-7K=1 T=O J= 


a p-l1 pe 

Then, Peona =k Sons), where >> denotes the sum over i; # p 
i=1 

for all 0 < j < k — 1. Therefore, 

1 k 1 k k 

HUP = 2 H(A, ) > EE ey AW.) = H(A) — 10) - 

qk) 

Furthermore, from the assumptions, A, = w(Xp) < £; thus (8.52) can be 
estimated as follows 


k-1 

1 

z(a) — H(A;,)) > eloge. (8.53) 
j=0 


In order to lowerbound (8.47), we first rewrite it by means of (8.51) as 


k-1 ý 
LEENS (wi, 40% 07, wor) = S (wo) -X AS (wi 07) 
j=0 ij i=1 
k-1 
HEE DA‘ (S (es 07) -5 (uh. 00%09)) 
j=0 ij 
Secondly, by means of (8.48), we lowerbound it by 
k—1 
Holy) -E+ =p Soy (s (wi,07) -S (ha 0 0f o 7)) . (8.54) 
j=0 ij 


Thirdly, we consider the expectations 
j j j 
w(x}, 0" [Wa(9)]) = 5 ol iaasa) Xi, Yiisrie 1 Vo(9)) 


= w(Xi; Wo(g)) + 5 w( (Yia, ri Xi; [Yisant Wo(9)] ) , 


iji ik= 


and expand the commutator as 
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k—j—1 
[Foi ik—1]? Welg al= 5 oi [Wo( fia) bias 
a= 


OLY Wal fisra-1)] [ORW Fizsa)] + WOO) OL Wo iga) 
OY Wal aah 
Since Wolfi) Wol fi) < ei Wo(f;)Wo(f;) = 1, then ||Wo(fi) ll < 1; there- 


fore, using (7.81), one can estimate 
=j= 


X New [Wol fis+a)] Wao) | 


k—-j-1 


Tat T td htd] Cam - 


a=1 Ta,ile 


IA 


[ionam] 


IA 


Consequently, if all functions have compact support, the commutator goes 
to 0 exponentially fast with n; now, the function fp not necessarily with 
compact support is in any case such that ||Wo(f,)|| < £ and any element in 
y(M) can be approximated in norm within £ by suitable Wo(g) where g has 


compact support. Therefore, one can adjust n so that | wl a oy—wi; ox] <e, 
whence (8.54) and the Fannes inequality (5.157) yield 


k-1 
D (wi 0007, wor) > Ha (y) — E€ + Ale) 


j=0 4; 


where h(e) — 0 when £ — 0. Together with (8.46), (8.47) and (8.53), this 
last estimate proves the result; indeed, for all y : M > Ag and e€ > 0, one 
can choose t large enough so that 


Ha (7) = i (04,7) > Hu (y) + e loge — 2e + Ale). 


The previous result puts into evidence the role of asymptotic commuta- 
tivity in establishing the existence of a memory loss mechanism. One won- 
ders whether the vice versa is also true, namely whether asymptotic memory 
loss implies asymptotic Abelianess and to which degree. The following result 
whose proof can be found in [42, 222] gives a partial answers to this question. 


Proposition 8.1.12. Let (A,O,w) be a quantum dynamical triple with A a 
hyperfinite von Neumann algebra of type II, equipped with the tracial state 
w. Then, this dynamical system is strongly asymptotically Abelian. 


The following corollary regards quantized hyperbolic automorphisms of 
the torus with rational deformation parameter 0 = p/q which are algebraic 
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quantum K-systems and expresses the generic inequivalence of this notion 
and the one of entropic quantum K-system. 


Corollary 8.1.1. The quantized hyperbolic automorphisms of the torus with 
rational deformation parameter 0 = p/q cannot be entropic K-systems. 


Proof: As seen in Example (7.1.13), these quantum dynamical systems 
cannot be strongly asymptotically Abelian. 


8.2 AFL Entropy: OPUs 


The quantum dynamical entropy developed by R. Alicki and M. Fannes [9] is 
based on an earlier approach of Lindblad [194] to the non-commutative gen- 
eralization of the KS entropy and considers the description of C* quantum 
dynamical systems (A, O, w) by means of quantum symbolic models. In anal- 
ogy with classical symbolic models (see Section 2.2), the time-evolution © is 
coarsely reconstructed by means of a shift automorphism O, on a quantum 
spin half-chain Ay (see Section 7.1.5) equipped with a particular (unlike 
for classical dynamical systems, in general not translation-invariant) state 
wy. We shall denote these quantum symbolic models by quantum dynami- 
cal triples (Ax, O©s,wx), where the subindex ¥ denotes the fact that they 
are constructed by means of operational partitions of unity (OPUs) in a way 
that can be physically interpreted as corresponding to repeated measurements 
performed on the system (A, O, w). 


Definition 8.2.1 (Operational Partitions of Unity). An operational 
|Z| 
i=? 


partition of unity in A is any finite collection of operators Z = {Z;} 
Zi € A, such that 
|Z| 


21; (8.55) 
i=1 
where |Z| is the cardinality of Z. 


OPUs correspond to the POVM measurements typical of quantum infor- 
mation (see Definition 5.6.1); as already observed, they are the most general 
algebraic extension of the notion of classical partitions to quantum systems. 
Furthermore, we shall see that OPUs can profitably be used instead of par- 
titions even in a classical context. 

Given two OPUs Zı = {Zi and Zo = {Za 221, the algebraic ex- 
tension of the notion of refinement of two partitions (see Section 2.2) is as 
follows 

Zi O Zə := {ZiZa} |22] . (8.56) 


i, j=1 
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What one gets by this definition is a finer OPU; indeed, 


|Z1| |Za| |Z2| 
5 Zi, zi, Zii Z2j = 5 Zi Zo = 1. 
aj=1 j=l 


Moreover, consider a measure-theoretic triple (¥,T, p) and the correspond- 
ing von Neumann algebraic commutative triple (L7°(¥),Or,w,). One can 


associate to two finite, measurable partitions P = {P;}!7| and Q = 1055 


of the measure space ¥ the OPUs Zp and Zg from Lo (X ) consisting of the 
characteristic functions yp, and xQ, of their atoms. It then turns out that 


ZpoZgq={x an aang l corresponds to the refined partition P V Q. 


Definition 8.2.2. Given an OPU Z = {z} C A, its time-evolution at 
time t =k € Z under the dynamics O is defined as 


|2| 
i=1 ' 


Z* := O*(Z) ={6"(Z,) (8.57) 


Further, Z\ will denote the OPU 
zn) = Zo O(Z) One e"-1(zZ) = {Zim Fam ea (8.58) 
where Zim = O” (Zin 1) O(Zix) Zig (8.59) 


and 23) 34 = ioir -++in—1 with ij € {1,2,...,|Z]}- 


Note that Z* is an OPU follows since © is an automorphism of A: 


|Z| |Z| 


YV o(z)'O(Z:) = o5 z$ z) =O(1)=1. 


i=1 


Then, 2) is also an OPU . 


8.2.1 Quantum Symbolic Models and AFL Entropy 
Given an OPU Z = {Z; ar let {| z; yzl denote a fixed orthonormal basis 
in the finite-dimensional Hilbert space C!#!. The |Z| x |Z| matrix 


Ma (©) > pl2]:= $D | %)( z; | o(ZÍZ:) (8.60) 


is a density matrix. Indeed, normalization comes from Definition 8.2.1, while 
positivity is ensured by the fact that 
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|Z| 


(leZ ly) = So dd; w(Z4Z;) = wl Zy) >0, 


$= 


where Cl4l > |4) = 4 yil ai) and Zy = DE| ot Z 
Furthermore, from w o O = w, it follows that 


plZ*] = pZ] VteZ. (8.61) 


Consider the time-refined OPU 2); the corresponding density matrix is 
of the form 


Myz\(C)®” > plZz™] = 5. | zi (zjm | w( Zim Zim) 5 (8.62) 
im jer 


where 
[zim ) = | za) @ | Zig) Q| Zin) - 


At each iteration of the dynamics O, one component is added to the OPU Z™ 
and one factor to the corresponding algebraic tensor product Mjz\(C)®”. 
Therefore, to any given OPU Z C A there remains associated a quantum spin 
half-chain Az (see Section 7.1.5), with a |Z|-dimensional spin at each site and 
a family of density matrices p [Z™], n € N. Since O is an automorphism, 
applying Definition 8.2.1, it turns out that these density matrices form a 
compatible family in the sense of (7.85), namely 


Trin41} (of"+)) = Tren) ( 5 | Zg(nt1) )( Zy(m44) |w (Zi un Zaen )) 


i(n+1) 
g(rt+t) 
= 5 | zi EAG | (Zinga (Zinga pw (Zh nin Zaar) 
i(n+1) 
g(rt+1) 
|Z| 
= a | zim )( Zj) | 2 (zi Taa (Z; Í Zi) Zim ) = plz") ; 
ur) jo 


Thus the family p[Z], n € N, provides a state wz over Az. 

The dynamics O on A corresponds to moving right along Az with the shift 
automorphism ©,; however, unlike the states of quantum spin chains (see 
Definition 7.1.11) which are 9-invariant, the compatible family {p[Z]},en, 
need not satisfy condition (7.86); namely, in general, wx o Os # wx. For 
instance, in general, 


|Z| |Z| 


TrjolZ®] Y lza) alae (Z}9(Z}, Za) Ze) # elZ] - 


i1, j1 =1 
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In the classical setting the KS -entropy is the maximal Shannon entropy 
per symbol over all symbolic models built upon finite measurable partitions. 
The AFL construction defines the quantum dynamical entropy of (A, 0,w) 
as the largest mean von Neumann entropy over all its symbolic models 
(Az,9>,wz) constructed from OPUs Z chosen from a selected 0-invariant 
subalgebra Ag C A. Because of the lack of translation invariance, it is not 
guaranteed that the mean von Neumann entropy of (Az, Os,wz) exists as a 
limit. 


Definition 8.2.3 (AFL -Entropy). Let Ag C A be a O-invariant subalgebra 
and let Z C Ao be an OPU ; set 


1 
hôF™! (9, Z) := limsup —S CE) , (8.63) 
where S' (o[2™]) is the von Neumann entropy of the density matrix asso- 
ciated with the OPUs 2™. The AFL -entropy of (A,@,w) is then defined 


as 


hAFL (9) := sup hAFL (6, Z) . (8.64) 
0 


When needed, we shall explicitly refer to the dependence on Ao by writing 
he” (O, Ao) s 


As for the CNT entropy (compare (8.27)), when one considers powers 0%, 
q = 0, of the automorphism O, one has the following bound. 


1 
Proposition 8.2.1. For all N > q > 1, it holds that —hAF™ (0%) > hAF™ (0). 
q 


Proof: For any given OPU Z = {2}, set 
BO) = GD 2] 0 0Ha], 


Given the OPU 2, q > 1, one verifies that (Z(9)(47) = z(a”), Therefore, 
writing n = kp + q with 0 < q < p, by means of (5.161) one gets 


saatna) 


1 
hAFL (O, Z) = limsup —S (pz) = lim sup 
n 


n= k=œ kq+p 

li 1 (ka) li E (ak) 
< -limsup —S ( p[Z“]) = =- lim sup —S | p[Z\%"] 

qd k-00 k q k-oo k 


1 AFL (67,2) f 
q 


In fact, since the states p[Z™] are density matrices on a spin-algebra 
Miz\(C)®” = Qio (Mz\(C));, one derives the bound 
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|s (oz) — s (of2])| < 5 (oraa) < 4 108121 , 


where Pix q,kq+p] is the marginal density matrix on QE (M a (C));. Since 


OPUs of the form Z are a subclass of all possible OPUs, one concludes 


1 
hy (O) = sup hô™™ (O, Z) < —h (6%) . 
ZEAo0 q 


Remarks 8.2.1. 


1. Suppose the dynamics is trivial, O = id4, namely O[A] = A for all A € A; 
from the previous result it follows that, if hêF* (id4) > 0, then it is 
infinite for one can choose an arbitrarily large q and id% = idy. This effect 
is clearly due to the perturbing action of the OPUs which themselves act 
as a source of entropy. Therefore, the O-invariant subalgebra Ag from 
where the OPUs are taken has to be chosen in such a way to minimize 
these perturbing effects. 

2. The request that OPUs consist of elements from a selected O-invariant 
subalgebra Ap C A usually comes from physical considerations. Indeed, 
OPUs as POVMs should correspond to physically realizable measurement 
processes which are always strictly local, namely they should consist of 
operators from local subalgebras of A. The obvious choice for Ao is thus 
the *-algebra containing all strictly local C* algebras. 

3. Unlike for the CNT entropy (see Remark 8.1.2), it is not known whether 
an equality of the form hAF (67) = qhAF" (O) holds. Indeed, the key 
ingredient in the proof of this equality for the CNT entropy is its strong 
continuity which is not usable in the case of the AFL entropy. Continuity 
is also important to check on its dependence on the OPUs : for results in 
this direction see [10, 133]. 


8.2.2 AFL Entropy: Interpretation 


Like the KS entropy, one can interpret the AFL entropy as the asymptotic 
rate of information provided by repeated, coarse-grained observations of the 
time-evolution; the difference from the classical setting is in that a coupling 
to an external ancillary system is required. This can be seen by going to the 
GNS construction (HL, Ts 2.) based on the O-invariant state w. 
Consider an OPU Z = {Z,;}21, the pure state projection onto 
|Z| 
Ho @ CF! 3 |Z) := XO nol(Z:)| 2u) Q | zi) 
i=1 


456 8 Quantum Dynamical Entropies 


has the following marginal density matrices (see (8.62)) 


|Z| 
pr = Troz| 22) (22| = > To(2:)| Ro ){ Ro Ms( ZZ) 
=: FZ[|25)( Ro |], (205) 
|2| 
pur = Ten, (18282 |) = Yo (2o | mo(Z}Zi) (Ro) |) (271 = ol] - 


The first marginal state is a mixed state on H, resulting from a POVM 
measurement (see Definition 5.6.1) on the GNS state | 2u )( Qu |. This effect 
corresponds to the action of a map which, in the GNS representation, is the 
dual of the following CPU map on A: 


|Z| 
A> Ar EzļA] =Y ZÍ AZ. 


i=l 


Since |W¥)(W¥| is a pure state, Proposition 5.5.5 ensures that p[Z] and 
FS [| 2. )( Qu || have the same von Neumann entropy. The same argument 
applies to the case of the refined OPU Z(): the von Neumann entropy 
S (p[Z™]) equals that of 


Fil RoN l= J TZ) RoN Ru Toli), (*) 


;(n) (n) 
i EQ) 


with Z,(m) as in Definition 8.2.2. Using the GNS implementation of the dy- 
namics, 7,(O(X)) = Uin,(X)U., and the fact that U.,| Ru) = | 2), one 
rewrites 


Tw (Zim) | Qs) = UE ts Zig 1 UE)? ") + Uns ts (Zi, Ub Tol Zio) Qu ) 
= U2 (Ulm (Zins Uha (Zinn) + Uditis Zio) | Qu), 
whence, setting U,,[A] := U„ AU}, A € A, (*) can be recast as 
Feel] Ro Mul] = UZ o (UL oFZ) [QAI (8.66) 
It thus follows that 
S (Egem ll Qu )( Qu I) = S (olZ™]) - (8.67) 


Therefore, the AFL entropy can be regarded as the largest entropy production 
provided by POVM measurements based on a selected class of OPUs and 
performed at each tick of time on the evolving system coupled to a purifying 
GNS ancilla. 
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Because of this fact, while the CNT entropy which corresponds to the 
maximal compression rate of ergodic quantum sources, the AF'L entropy ap- 
pears to be related to the classical capacity of quantum channels. We shall 
show this after providing examples of quantum dynamical systems where the 
AFL entropy can be explicitly computed. 


8.2.3 AFL-Entropy: Applications 


As for the CNT entropy, we first ascertain whether the AFL entropy reduces 
to the KS entropy when the algebraic dynamical triple (A, O,w) describes a 
classical dynamical system (¥,T, u). 

As already remarked, classical partitions P = {R}, of ¥, are associated 


with OPUs Zp = {xp,}!7\, then 
ES Be eee onip {o""(xp,,_, JO" (xp, de XPa} 


~ f xrm (Pi, OTP, aP} {xew } i 


so that Z™) is the OPU associated with the refined partition P™®) 6 and 


plz™®] = 5 [zim )( Zjm | H (Pym) Pic) 


i(n) g(r) Ca) 
wd El Zp] 


XO [amm | u (Piw) 


(n) (n) 
i El2izp] 


is diagonal with eigenvalues ju (P;(n)) so that (see (3.1)) 


a 


s (ozp) == Ð MPi) log (Pym) = H, (P) 
Meo 
; 1 (n) \ _ pKS 
lim sup =5 (zt J) = hS (T, P) . (8.68) 


However, in view of the fact that one is free to choose more general OPUs than 
those arising from classical partitions, Definition (8.2.3) may in general lead 
to an AFL entropy of (¥,T, u) which is larger than its KS entropy. Actually, 
this is not the case: in order to prove it let us consider a generic OPU given 
by a finite collection F := { AAI of essentially bounded functions such that 


|F | 


AP = 1] L3(4) . 
i=1 


°The notations is that used in (2.40) and (2.39). 
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The corresponding density matrix (see (8.60)) reads 


|F | |F| 


A= D aal f uO E GOD 
- | dines (8.69) 
x 
with {| Pa an ONB in Cl! and 
IF| 
Mis (©) > Pele) = BN, e) =D Ala) a). 610) 


Notice that, because F is an OPU, the Pr(a) are projections onto normalized 
vector states. If an OPU results from the refinement of other OPUs , then 
the associated density matrix is a continuous convex combination of tensor 
products of projections of the form (8.70). Concretely, if F = Fio Fao- +- Fn, 


Q Miz, (C) 2 lA = f dule) Prs(2) @ Pale) 8 Pra). (871) 


Using this expression it is possible to prove that, without restrictions on the 
OPUs taken from M := L% (X), the AFL entropy of (M, Or, wp) coincides 
with the KS entropy of (4, T, u). 


Proposition 8.2.2. ie (Or, M) = he (T) (see Definition 8.2.3). 


oe a P = {P;}¢, be any finite measurable partition of VY with 
= Te o Lame its dynamical refinement up to time t = n — 1 and let 


F be any other OPU from M. Given F™ as in (8.58), let 
Prim (2) = Pr(2) ® Pri(2) 8- Prai(a). (+) 


Since the atoms of P™ are disjoint, one can decompose p| F™] into a convex 
sum of other density matrices in Mz (C)® ”: 


pF |= [ Au (@) Prole) = JO MPR) Pum 


imer 
: du (2) Pron (2) 
Pr = TT HAT FT). 
HEN) POA 


Thus, the concavity of the von Neumann entropy (5.156) and the triangle 
inequality (5.161) implies 
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S (oF) <- D MPR) og uP) + DO w(PL}) Sie) 
Me™ ME Q™ 
= HP®) + So (PL) S (pgm) 


imen 


<H,(P™) + > S e)s (Pi) 6%) 


imeR™ J= 


1 
where, from (*), pk := — | du (x) Prr (x). 
pF) YPE 
Let Qr € MjF\(C) be a projection; since S (Qx) = 0, the Fannes inequal- 


ity (5.157) implies 
S (Ppr) < llek — Qili log |F] + nllr — rll) - 


Notice that each pp € CIF! is a continuous convex combination of pure state 
projections Pran) (x); the partition P is arbitrary and can always be chosen 
in such a way that each pẹ stays sufficiently close to a projection Q; so that 
the right hand side of the previous inequality can be upperbounded by a 
quantity independent of n and i. Consequently, dividing both sides of (+) 
by n and taking the lim sup obtains 


1 
haf (Or, F) < limsup —H p(P™) =H (T, P) < h¥S (T) . 


n—+00 


On the other hand, from (8.68) one gets 
håFL (Or, M) > sup hê! (Or, Zp) = h¥S (T) , 
Zpell 


where IT is the *-subalgebra of M containing the OPUs Zp arising from all 
possible measurable partitions of ¥. 


Evidently, one would like to reach the KS entropy by computing the AFL 
entropy on a smaller set than the whole of M = L?°(4). The search for 
a suitable «subalgebra My C M starts with the introduction [10] of an 
entropic distance between two OPUs Fi, C M defined by 


ALF Fa] := S (p[Fi o Fa]) — S (p[F a) . (8.72) 


Some useful properties of the entropic distance can be extracted by in- 
specting more closely the consequences of (8.71). Indeed, it turns out that, in 
a commutative context, the entropy of a composite OPU is invariant under 
permutations of the constituent OPUs : 


S (p| Fi © Fz 0 F3]) = S (p| F2 0 Fi 0 Fs) , (8.73) 
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for all OPUs Fı 2,3 C M. In fact, because of their tensor product form, the 
density matrices p[F, o Fy 0 F3] and p| Fz 0 Fı o F3] are unitarily equivalent. 
Also, (5.161) yields 

S(p[Fi © F2)) < S(plFi]l) + S (elFa)) , (8.74) 


for all OPUs Fı2 C M. Indeed, by partial tracing p| F1 o F2] over CIF], 
respectively C!72!, one gets 


Tra(o[Fi 0 Fal) = I du («) Pr, (2) Te(Pr,) = AFA] 
Tio 0 Fel) = I djs (#) Te(Pr, ()) Pr,) = oLFa) - 


A more interesting property is the following one: for all OPUs Fi, C M, 
S (p[F1 ° F2]) > S(p[Fi)) - (8.75) 


To prove this, consider the case in which the integration measure in (8.71) 
- a discrete probability distribution u = {p,;}4_,, that is p[F; o Fy] = 


2 pj Pr, (j) ® Pr, (j). Then, construct the density matrix 


d 


pra = >> p li Xil 8 Pr) @ Pr (i) , 
j=l 


where {|7)}4 j=1 is an ONB in C4, where the labels denote the factors from 
left to right. Notice that 


a 


p2 := Tr13(p123) = pi Pri (j) = olFi] 
j=l 


a 


pı2 = Tr3(p123) = 5 pilil D Pr, C) 


d 
p23 = Trı (p123) = >) p; Pr, (3) @ Pr, (j) = plFi 0 Fo] . 


Then, strong subadditivity (5.162) yields (8.74): indeed, because of Re- 
mark 5.5.5 and of the fact that Pr,(j) projects onto a normalized vector 
in CIl, it turns out that 


S (p123) = S (p12) ~~ does log p; . 
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Finally, probability distributions given by regular Borel measures u can be 
approximated by discrete ones; thus, the inequality (8.75) can be extended to 
generic u [10] by means of the continuity of the von Neumann entropy (5.157) 
(notice that all the density matrix considered act on a Hilbert space of fixed 
finite dimension). 


Lemma 8.2.1. Given OPUs Fı 23 C M, the entropic distance satisfies 


ALF |F] > 0 (8.76) 
AlOr[F]l Orl F] = AlFi|Fo] (8.77) 
A[Fi © Fal Fs] < AlFil|Fs] + A[F2| F3] (8.78) 
ALF lF 0 F3] < ALF |F] (8.79) 
ALFE FSO] < nAlFi|Fa] - (8.80) 


Proof: Positivity is a consequence of (8.72) and (8.75) while time-invariance 
comes from (8.61). Subadditivity in the first argument can be derived as fol- 
lows. By using (8.73) one gets 
AlF, 0 Fa|F3] = S (p[Fi 0 F2 0 F3]) — S (p|Fi © Fa) 
= S (p| Fi fe) F3 D Fal) = S (p| F (0) Foal) : 
Setting p123 = p|Fı o F3 o Fə], it turns out that po = Tri3(pi23) = p[Fs], 


while P12 = Tr3(pi23) = plFi O F] and P23 = Tri (p123) = p| Fe O Fs]. Then, 
strong subadditivity (5.162) yields 


S (p| Fi © F3 0 Fa]) + S (p[Fs]) < S (e[Fi o F3]) + S (P|F2 0 Fs) , 
whence 
AlFi o F2|F3] < S (p[Fi1 © F3]) + S (P|F2 0 F3]) — 25 (p[F3}) 
= ALFF] + A[Fe|Fs] . 
Further, using (8.78) and (8.73), 
AlFi|F2 0 F3] = S (p[Fi © F2 0 F3]) — S (P[F2 0 Fs) 
= A[F 0 F3| F2] — AlF3|Fo] 
< AFi |F] + AlF3|Fo] — A[F3|Fo] = AlFi|F a] . 


Finally, if in (8.78) one puts Fm (see (8.58)) in the place of Fy o Fz and 
F\™ in the place of F3, then using of (8.79), (8.73) and (8.77) one gets 


n—-1 n—1 
AFL? FE] < Y AFFI] < Y AF" Fok] 
k=0 k=0 
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Definition 8.2.4. /10] A *-subalgebra with identity Mo C M is entropy- 
dense in M = LY (X) if for any finite, measurable partition P of X and any 
E > 0 there exists an OPU F C Mo such that A[Zp|F] < £, where Zp E€ M 
denotes the OPU corresponding to P. 


Theorem 8.2.1. Let (LX (X), Or,w,) be the algebraic triple corresponding 
to a classical dynamical system (4,T, u). Let Mo C M = Lie(X) be a O- 
invariant entropy-dense *-subalgebra of M with identity, then 


AFL _ KS 
ho, (Or, Mo) =h" (T). (8.81) 
Proof: Since My C M, Proposition 8.2.2 gives HS (T) > i (Or, Mo). 
Let P be any finite, measurable partition of X, P™® its dynamical refinement 
up to time t = n— 1 and Zp, Ze) the corresponding OPUs in M. Fixe > 0 


and choose F to be an OPU in the entropy-dense Mo such that A[Zp|F] < €; 
then, using (8.75), (8.73), (8.72) and (8.80) one gets 


s(a < 8 e oF) = 8 (oF 0 2) 
=s (a) + i) 2910) + ne 
By dividing by n and taking the limsup, (8.68) obtains 
Hoe Cnet ba FTE h (Or, F) +e < Ho” (Or, Mo) +€ 


for all € > 0. Therefore, i (T,P) < hoe (Or, Mo) for all finite, measurable 
partitions of ¥ whence hi}S (T) < hort (Or, Mo). 


Example 8.2.1. Consider the hyperbolic automorphisms of the torus T? 


studied in Example 2.1.3. To the measure-theoretic triple (T?, T4, dr) one 
associates the algebraic dynamical triple (M,@4,w) where M := LẸ, (T°), 
Oa := Or, and w is the state obtained by integration with respect to dr. 
We now show that the *-subalgebra Mg C M linearly spanned by the expo- 
nential functions en (r) = exp (27 in - r) is entropy-dense in M. 

Given a fixed N € N, the following collection of exponential functions 


en 
Fu = (oh , Iy = {m= (m,m) : -N <ni <N}, 
VM ) nety 


where M := (2N + 1)?, is an OPU ; indeed, 


D 


nEn 


2 
en 


VM 
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where {1 is the identity function on T?. Notice that the functions en are 
orthogonal, namely w(e}, €m) = n,m; thus, (8.60) reads 


plFx] = <7 D len Men! 


nEly 


where {| 2n)}nery is an ONB in C™. Then, S (p|Fn]) = log M which is also 
the von Neumann entropy of the state in (8.65), 


Hl Qu) J=5 Ð len) (enl = 


neIn 


where Py is the orthogonal projection onto the M-dimensional subspace 
spanned by the M orthogonal vectors | en). Also, we have chosen the GNS 
representation where | 92,,) is the identity function on T? and the action of 
Tu| f) on vectors of Lå, (T?) is the multiplication by f € M: (r|mu(f) |W) = 
f(r)W(r) (see Example 5.3.2.2). 

Consider now a finite, measurable partition P = {P;}"1, of T? and the 
corresponding OPU Zp; we want to estimate 


AlZp|Fn] = S (p[Zp ° Fy) — S (o[Frn]) = S (p[Zp ° Fy]) — log M . 


The von Neumann entropy of p[Zp o Fy] is the same as the von Neumann 
entropy of (see (8.67)) 


on = FEporyll Qu Pall = EE E Tolen) Mu) Mas tral xP eR) 


j=l nEn 
1 m 
= 57 Gi Pv Qj, where (r|Q;lb)= xe, (r)y(r) 
j=l 
= ~ l i, = 5 Pn Q; 
= m Tr(Q; Pn Q;j)cj, where oj := Tr(Q; Py Q)) PyQ) 


To compute Tr(Q; Py Qj) we use Example 5.2.3.8; since Q; Py Q; = XÏ X, 
X := PyQ;, its spectrum is the same as that of X X= Py Q; Pn and pie 
their traces are the same. Since 


Tr(Py Q; Pn) = 5 (én |Q; len) 


nEly 


S ar lenf len(r)|? xp; (£) = Mu(P;) , 


n€In 


it follows that oy = J). j=1 H(P;) oj. Furthermore, as the gj are density ma- 
trices with orthogonal ranges, Remark 5.5.5 yields 
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= -One + Yo P) (FRET OD 4 tog a y(P,))) 


1 
=F 2 Tr(n(Q; Pn Q;)) + log M whence 


APS ROG Oe =ar 
j 


We now conclude tgeh proof by showing that, for N large enough, the right 
hand side of the last inequality can be made negligibly small. As already seen, 
Q; Pn Q; and Py Q; Pn have the same spectrum; therefore, 


Tr(n(Q; Pw Q;)) = Tr(n(Pn Q; Pr) - 


Further, n(x) > x(1—2) for all 0 < x < 1 and ņn(x)—x(1—x) is bounded; thus, 
for all € > 0, there exists C(e) > 0 such that [10] n(x) < € + C(e) (1 — 2). 
Applying this inequality to the eigenvalues 7, of Py Q; Py and observing 
that 0 < Tjk < 1 for Py Q; Py < 1, one gets 


7 BCP Po) = pp Dlr) < g DOLE + CO mal =a) 


<e + Of) wae Py) — Tr((Py Q; Pv)?)) 


ml 
=e+C(e) (u(P;) = + 5 (en |Qj Pr Q; len)) i 


neIn 


In the second inequality it has been used that the range of Py has dimen- 
sion M. Since the exponential functions |en ) form an ONB in L4, (T°), by 
increasing N (and thus M) one makes Py — 1 so that 


1 1 
Mu XO (en | Q; Pw Qj len) = u(P;) — Vi X (en |Q;(1— Pv)Q; len) 


néIn nEn 


tends to (Pj) when N — oo. 


AFL Entropy: Finite Quantum Systems 


Like the CNT entropy, also the AFL entropy vanishes for finite-level quantum 
systems. In order to show this we start by deriving a useful bound [10] on the 
von Neumann entropy S (p[Z]) of a given OPU Z = {Z; yl c C B(H), when 
B(H) is equipped with a state represented by a density matrix p. 
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Let : <r; < 1 and |r;) be the eigenvalues and eigenvectors of p and 
{| 2; ne Z| an ONB in C!!; because of Definition 8.2.1, the vectors 


|Z| 
|Z): =o Zel 79) ® | zn) 


are orthogonal, indeed (8.55) yields 


|Z| 


(UF wf) =X (15 | 2 Ze lre) = (ry | re) = bye - 
k=1 


Set pz = Di, ri | UP (UF |; then, S (pz) = S (p) and 


|Z| 
Bı (H) > pr := Tr77(pz) =X X r; Za|r;) (r; |Z! =: F zlo] 
j k=1 
|Z| 
Bi(C!*!) 5 prr := Trr(pz)= XO Tr(o Z} Z;) |) (k| = olZ] . 
j,k=1 


Applying subadditivity 5.161 to these marginal density matrices one obtains 
the following upper bound to S (p|Z]). 


Proposition 8.2.3. Let p € B,(H) be a state on A = B(H) and Z = 
{Z; y2 C B(H) any fixed OPU ; then 


S(p[Z]) < S(p) + S(Falp)) . (8.82) 


en 


If B(H) = Ma(C), then the von Neumann entropy of both p and Fz[p] are 
upperbounded by log d independently of Z. Since in the case of a finite-level 
system the dynamics is implemented by a unitary operator which belongs to 
Ma(C), all OPUs from M;(C) are such that also the refined OPUs Z™) up 
to discrete time t = n — 1 also belong to Mq(C). Then, the following results 
holds. 


Proposition 8.2.4. Let (A,O,,w) be a finite-level quantum system, where 
A= M,(C), w corresponds to a density matrix p € By(C*) and © is imple- 
mented by a unitary U E€ Ma(C). Then, for all OPUs Z C Ma(C), 


hy (0,¥)=0, br (0)=0. 
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Remark 8.2.2. The fact that for finite-level systems the AF entropy van- 
ishes is neither a surprise nor is it the end of the story. Indeed, in quantum 
chaotic phenomena [75, 322], namely when studying the behavior for A — 0 
of quantum systems with a chaotic classical limit, the associated classical 
instability manifests itself in the presence of a logarithmic time-scale as in 
Remark 2.1.3.4. Roughly speaking, the explanation of this fact stems from 
the fact that, in the semi-classical approximation, quantizing means operat- 
ing a coarse graining of the phase-space into atoms of size 27h: this forbids 
the existence of a bona fide Lyapounov exponent, but makes its classical ex- 
istence felt up to times that scale as — log(fi/S) (where fi is normalized to a 
reference classical action $). In some models, as for instance the quantized 
finite Arnold cat map in Example 5.6.1.3 and the kicked top in [184], the clas- 
sical limit can be mimicked by the dimension of the underlying Hilbert space 
N — +o. The AFL construction, notably the entropy of a time-evolving 
OPU , has been applied to such cases and proved to increase linearly with 
the number of timesteps T up to T ~ log N [10, 12, 25]. Interestingly, the 
AFL entropy has also been applied to study the emergence of chaos in the 
continuous limit of discretized classical dynamical systems [24, 26], where the 
suppression of instability also finds its root in a finite coarse graining of the 
phase space. 


AFL Entropy: Quantum Spin Chains 


Interestingly, the AFL entropy of quantum sources differs from the CNT 
entropy by a correction term which increases with the dimension of single site 
algebras. According to Remark 8.2.1, the OPUs will be taken from strictly 
local subalgebras of the quasi-local source algebra A. 


Proposition 8.2.5. Let (Az,w) be a quantum spin chain with single site 
matrix algebras Ma(C). Relative to OPUs from any local subalgebra Ajp,q); 
X C Alp.) E Ao = AR’, the AFL entropy is given by 


ho (Oz) = s(w) + log d , 


where the dynamics is the shift Os over Az, and the translation-invariant 
state wo Os =w has mean von Neumann entropy s(w). 


Proof: Because of translation-invariance of w, it is no restriction to take 
X = {Xi}P1, Xi E€ Apg. It follows that the dynamical refinements a) 
are localized within [0,@+k —1]. With p = wfAjo e441) and Fyw [p], both 
density matrices in M,(C)® +), (8.82) yields 


S(w fA = 
hAFL (O,, X) < lim sup ai a ) 
k= 


logd = s(w) + logd , 
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whence hAF" (6,) < s(w) + log d. i he upper bound is reached by an OPU 
consisting ¥ made of matrix units ef g, := | Po }( qo | from the algebra M,,(C) 
at site 0, where {|7)}“, is an ONB in C” [133]. 


eos 1 
Explicitly, ¥ = { T2 epo qo \ aia 
qo:Po) 


r k—1 
(k) _ J pqa) - Q i 
X = JE l ; Eplk) glk) = Cn: di : 
V (p), qg) i=0 


According to (8.60), the matrix elements of the (d* x d*) x (d* x d!) density 
matrix p[4(")] are thus given by 


1 1 k—1 ; ; 
pl) sane = qe” (ef eft) | = qr” | a 
k-1 
= qk all or ripi W (Be 


The expectations on the right hand side of the last equality define the local 
state pjo,k—1] = w[Ao,n—1) So that 


F 
I 
© i 
k 
>. 

= 

Q 

= 
See 


i 


; 1 
pæ ™®] = gr leh 8 Po,k—-1) : = S (pæ) = S(pp0,n-1)) + klog d 


and hAFL (@,) > hAF4 (6,) X = s(w) + log d. 


AFL Entropy: Price-Powers Shifts 


Price-Powers shifts (see Definition 7.1.14) provide non-commutative contexts 
whereby the differences between CNT and AF'L entropies can be better ap- 
preciated [13]: indeed, it turns out that, while the former depends on the 
bit-stream g, the latter does not. 


Proposition 8.2.6. Let the triplet (Ug, Oo, w) represent a Price-Powers shift 
with bitstream g; then, relative to local OPUs , hAF’ (O,) = 1 independently 
of g. 
Proof: As for quantum spin chains, OPUs will be taken from local sub- 
algebras, ¥ C Ap = utos; by translation-invariance of the state w, we can 
always suppose ¥ = {X;}4_, C Ujo so that X) C Uioe44—1)- The lat- 
ter local subalgebra is not isomorphic to a full-matrix algebra, rather to an 
orthogonal sum of m v; x vj full-matrix algebras: Ujo e+k-1] = Dja M, (©). 
As a linear space Ujo,¢+k-1] is 2‘+* dimensional (this is the number of 
independent W; that generate it), while each of the contributing M,, (C) is 
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a v?-dimensional linear space, whence the constraint 2°t* =)", v?. From 


the splitting of U,¢; the elements Xj (x), t i) = = ioii +- tn—1 E€ ae ) of X(*) 
can be decomposed as Xj(x) = Dye bee and 


m 
P[0,2+k—-1] = B Ôj Tj , 
j=1 


where 7; = il, are tracial states on M,,(C), while 0 < 6;, 7", 6; = 1 
account for the various multiplicities. It follows that 


k k ; ; 
“Ls ips , (p\ CF cap tt) = 75 Cems : 


Then, (8.82), (5.156), (5.155) and concavity of log x yield 


m m 2 
1) < 0555 (o) + log m < $74; los 3 
j=l = 


<log(S\ v7) =l+k, 


j=] 


whence hAF“(6,) < 1. The bound is attained at the OPU consisting of 
orthogonal projectors at site 7 = 0, 


1+ (-) eo i 


X = {pl}, P9}, p?:= 


(k) — Sp. = 77? j 
In fact, X) = {piw = Meerin hoea and 


k—1 
plX™)] 0) 5) = wlw p) = 27” JI Ong = 27E j 
1=0 


The last equality follows by using (7.119) and (7.125); w is tracial and the p$ 
orthogonal for fixed i, thus 


k—1 
w(Ppjæ pir) =w (í Pj, I ita 
k—2 „k—1 „k—2 1 
= Õiojobir- ijr- (o, o Piz Pir abir a * -p ) 
0 k—-2,k-1,k-3 1 
= Gig jo Sin 15x —1 Oin—2n—2Y (e ‘Pik-2Pir—-1Pik-3 vat) + 


k-3 „k—2 k-1 
+ Sinjo Sinan. 7 w (-- "Pix sPjn—2 [ph oe 1 Pi 3 o) (*) 
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Using (7.119) one calculates 


cer 
[pi = Ton J= = rie k—-2tJk—-1 (1 = (—)9)) Cp €h—2 - 


Notice that either this commutator vanishes because g(1) = 1 or the operator 
Pins [ph , aa] (it belongs to the subalgebra Ujk—2,ķ—1]) cannot be turned 
into an identity by means of (7.117) as all the other p come from different 
sites. Thus, (*) vanishes. Iteration of this argument yields 


opps I Sig je W (m pes) = =2 "bij g 


The density matrix p[4“] is thus diagonal with eigenvalues 2~* whence 
S(p[4¥)]) = klog2 and hAF" (@,) > 1. 


Remark 8.2.3. While the AFL entropy is always log2 for all bitstreams, 
instead the CNT entropy varies from 0 to log 2 (see (8.35)— (8.37)). Since the 
bitstream fixes the degree of departure from commutativity, this fact indi- 
cates the CNT entropy is sensitive to the dynamics, but also to the algebraic 
structure of quantum dynamical systems, in particular to whether they are 
asymptotically Abelian. On the contrary, the AFL entropy accounts for the 
effects of the dynamics not directly, rather through a particular family, in 
general not translation-invariant, of local density matrices over a quantum 
spin chain. As such, it is more sensitive to the properties of the state w and 
in some cases strongly depends on the OPUs that are used to construct the 
local density matrices. The effects of the OPUs are at the root of the fact 
that the AFL entropy of a spin chain is the entropy density augmented by the 
logarithm of the dimension of the spin algebras, hê! (O,) = s(w) + log d, 
whereas the CNT entropy equals the entropy density hONT (@,) = s(w). If 
freely chosen and not carefully selected from a suitable O-invariant Ap in such 
a way that the perturbations are kept to a minimum, it may happen that even 
dynamical systems without dynamics may have non-zero AFL entropy. An 
abstract though revealing example is that of a so-called Cuntz algebra [10], 
namely the C* algebra A generated by the identity 1 and by linear combi- 
nations of products W; := Sj,5;,---S;,, of two isometries S;, i = 0,1, such 
that 
Siso=StSi:=1, SoSh+S5,Si=1. 


It turns out that 
Sis, = Si( SpSi + s.st)s, = sis, + sis, —> sis, =0. 


Let the Cuntz algebra A be equipped with the tracial state w(W;) = 0 unless 
W; = 1 in which case w(1) = 1 and take the dynamics as trivial © = id, 
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namely, O[Wi] = W; for all W;. While hSNT (id4) = 0, the AFL entropy 
of the OPU ¥ = {S;/,/2}1_, diverges. Indeed, the elements of the partition 
X) are of the form X;a) := 27*/? Si, ipai °° * 54,3 thus, 


i ik—1 a) 


Xl Xj = FE . ost gt OF, Oye 4 . S85, =2Q-k i) j) whence 


tk—1 tk 


S (pæ) = k log 2 => hAFE (id4, ¥) = log2 . 


Therefore, using Remark 8.2.1.1, one deduces that, if the OPUs are freely 
taken from A, then hA¥" (id 4) = +00. 


AFL Entropy: Arnold Cat Maps 


We now consider the infinite dimensional quantized hyperbolic automor- 
phisms of the torus in Example 7.1.6, namely the triplets (Mo, O4, w), 
where Mg is the von Neumann algebra generated by the Weyl opera- 
tors (7.30), equipped with the automorphism (7.34) and the O,-invariant 
tracial state (7.34). 

We shall show that, independently of the deformation parameter, when 
the OPUs are taken from the O,-invariant » subalgebra Ao generated the by 
Weyl operators Wọ( f) where the f have compact support, the AFL entropy 
of (Mo, O,,w) coincides with the KS entropy [15] (see Proposition 3.1.1). 


Proposition 8.2.7. hôf} (O4) = loga for all (Mo,@,,w), where a > 1 
is the largest eigenvalue of A (see Example 2.1.3) and the OPUs are taken 
from the O,-invariant subalgebra Ao C Mog which is generated by the Weyl 
operators Wo(f) with compact Supp( f). 


Remark 8.2.4. Since the AFL entropy does not depend on @ and because, 
for 0 = 0, the quantum dynamical system (Mg, Oa, w) reduces to the clas- 
sical hyperbolic automorphisms of the torus (see Example 8.2.1), the proof 
which follows is another way to compute i (Ta) and thus the Lyapounov 
exponents. 


The OPUs that will be repeatedly used in the following have the form 


ibi P 
e 
Ž= fz = Wain) } nz EZ . (8.83) 
VP i=1 
: $ Onin; i(Bi—B;) 
Since w(Z; Zi) = ——e'\"~"’, from (8.60) one computes 
p 
£ i(Bi—B;) 
e 
AZ = do laXaleziza= Do els 


tas i3 NiS j 


The index set {1,2,...p} can thus be divided into disjoint equivalence classes 
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{1,2,..- p} = UE , li] := {1 <a < p : Nna = ni}. 
If #[i] denotes the cardinality of [i], one can then write 


AZ] =F = Do el) | a l= Do ual tJ) E | (8.84) 
(i) 


1 
E P abeli 


1 , 
where the vectors | [i] ) := —— X etfi] za), are orthogonal. Thus, 


# li] a€ [i] 
Hl), Hl) Hl 
3 (012) = D Hog ŽI Fal 1 ) (8.85) 


where (2.84) has been used. Consider now two OPUs of the form (8.83), 


z fio Walmer yy z Sa Wn" 
ji P = > 2 VP2 i . 


According to (8.56) and (7.29), their refinement is an OPU of the same form 


etb (ii) (1) (2) P1,p2 
Zio% = Wolni +n; ‘ 8.86 
roZ = | Won +n} Jan (8.86) 


One now introduces the equivalent classes 


li, j] = { (a,b) nP +n?) =n +n?) 1<a<m 1< b< pe} ; 


Notice that if x € [a]; is such that there exist y € [b]2 such that (x,y) € [i, jl, 
then this is true for all pairs (u,v) with u € [a]; and v € [b]2. Therefore, one 
can write 


#(i, i] = 5 Hla]: # [be - 


[a]ı : 3b s.t. [a,b]=[i,7] 


Lemma 8.2.2. The following two properties hold: 


S (p[Z1 © Z2]) = S (p[Z2 © Z1]) (8.87) 
S (p[Z1 © Z2]) > S (plZ2]) . (8.88) 


Proof: The first equality follows from (8.85) applied to (8.86) which gives 


sez = Son (#4) l 


A Ppıp2 
[i.3] 
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The second one can be derived as follows: set 


#lalļı 
p 


N(i, j) := 
[a]ı : 3b s.t. [a,b]=[i,5] 


and use that (xy) = xn(y) + yn(x) > xn(y) to estimate 


1 #lah a) 
N(i, j) PL p2 


S (Zr 0 Za) = So n( NG, 3) 


[i,j] [a]ı :3b s.t. [a,b]=[i,j] 
n #la #(b 
=) Ni j)n > or in 
lij] [a]ı :3bS.t. [a,b]=[i,j] J. Pt P2 


Since 


[a]ı : 3b s.t. [a,b] =[i, j] 


the concavity of n(x) yields 


S (p[Z1 0 Z2]) > > + lah „(#22) = Tan 


(eal (ah Sed elie 
= S (p|Z2)) . 


In fact, by summing over all [7,7], one sums over all [b]2 and [aJ1, the latter 
being as many as pı /#[a]. 


q 
We now concentrate on a special OPU , Z = { at Wo (nj)} where, 


for a fixed q € N, we choose ny := ([a%] — 1)n, 0 < j < q, with Z? > n #0. 
Also, [af] < af denotes the integer part of the j-th power of the eigenvalue 
a > 1. The refined OPU 


Z% = O-z] o Of) [Z] 0---O8[Z] o Z 
has |Z%| = [a7] elements of the form 


e-1 
eG) Wo (nj )) , nj) = Slat] -1A fn, 
k=0 


where j® = joji-.-je-1 with je € I(q) := {0,1,...,[a?~-1] — 1. In order to 
evaluate S (po|Z%*]), we have to investigate the equivalence classes determined 
by relations of the form n(r) = n(s), r,s© © If. By expanding 
n along the (linearly independent) eigenvectors |+) of AT relative to the 
eigenvalues a*!, |n} = y| +) +6|—), one gets 
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£-1 
oA (A) = Pte]) = AK) (TA — [A5] + 
k=0 


£-2 


+ Paena] — [A"4])) = 0, 


k=0 


By choosing q large enough, such an equality can only be true if re_1 = ke_1; 
iterating this argument yields 


n(r) = n(s) 4 r = 8 | 


This implies that the equivalence classes contain one element only, [r] = 
r®, so that (8.85) obtains 


il 
Ls (l2"4)) = $ 10g |Z" = toga’) ; (8.89) 


Lemma 8.2.3. hôf! (O4) > loga. 


Proof: Given the OPU of above, consider the refined OPU 
ZUU-DH — OU Viz) o 0V2] o... Og[Z] 0 Z. 


As already remarked, by refining any pairs of the constituent OPUs one gets 
an OPU of the form (8.83); thus, by repeatedly applying (8.87) and (8.88), 
one gets 

5 (a20) > CE) , 


One finally estimates 


hAFL O,)> li S gza@—1)+1 

o (Oa) 2 me pI (01 J) 

1 1 1 

> —limsup 48S (p121) > -logļa?] , 
q l-++00 4 q 


and the result follows by choosing q arbitrarily large. 


In order to reverse the inequality in the previous lemma, we shall consider 
OPUs of the form Z = {W(f;)}?_1; notice that 


P 


Y wW fol fi) = I> Do A= 1 


i=1 i=l 


as turns out by computing the expectations with respect to w of both sides of 
the operatorial equation and by using (7.35). Let the support of Z be defined 
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as the union of the supports of the constituent f;, Supp(Z) := U?_, Supp( fi), 
where Supp(f) is the set of n € Z? where f(n) 4 0 (see (7.31)). The com- 
pactness assumption means that, given Z there exist finite real constants g, 
d such that Supp(Z) C Rg,a, where 


Roa = {n€ Z? : |n) =qla})+ôla-), l] < g, [ôl <a} , 


with | a4) the eigenvectors of A. 

As seen in Example 7.1.6, the vectors 7.,(Wo(f))| 2.) in the GNS rep- 
resentation, amount to the ((Z2) vectors | fi) = {f(m)}nez2; thus, (8.65) 
gives 


=§ z ERIS ) < log #(Supp(Z)) , (8.90) 


where #(Supp(Z)) denotes the cardinality of the support. 
Since the refined OPU Z has elements 


On [Wal fies] Ox “[Wolfie-2)] = CalWo(far)] WoC fio) » 
and each oF [Wo(fi,;)] is supported by vectors of the form 
AJ|nj) =y a+) + dja74|a_) , 


with nj € Supp(fi), it turns out that Supp(Z) consists of vectors 
e-1 £-1 
= (Sleis)lax) + (Y a8 )la- 
j=0 j=0 
2—1 m é-1 
a] 
4 -= > D ay] s 
j=0 j=0 


whence #(2(")) = O(a”) and, from (8.90), 


lim sup — Ls (p [Z ]) < loga 


n— +00 


for all OPUs from the chosen Ao. 


Lemma 8.2.4. hAF" (O4) = supzey, hF! (O4) Z < loga. 


A 
w 


Finally, Lemma 8.2.3 and Lemma 8.2.4 together prove Proposition 8.2.7 
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8.2.4 AFL Entropy and Quantum Channel Capacities 


In Section 7.3.2 we have discussed the encoding of strings i™ of symbols 
from an alphabet [4 emitted by a classical source by means of quantum 
code-words p(i‘”)) taken from a statistical ensemble with weights p(i”). 
In [8] a different encoding protocol is proposed; it uses a quantum dynamical 
system (A,O,w) and the CPU maps M(Ao) 3 E : At A. The idea is to 
encode strings i) = iiz: in by perturbing the state w with CPU maps 
fj, at each stroke of time t = j, 1 <j <n. 


Remark 8.2.5. As A is in general a quasi-local algebra, in order that the 
encoding protocols be physically implementable, the CPU maps are chosen 
to consist of finitely many Kraus operators taken from the union Ao of all 
strictly local subalgebras, 


A> A= E [A= D> Xlr AXiyr, Xizn € Ao, 
kEI(t;) 


where ker (ig) paneer = land the index set I(i;) is of finite cardinality. 
The CPU maps are further distinguished in E € M,(Ag) when they are 
bistochastic, see Example 6.3.3.1, in which case they are entropy increasing, 
and E € M,(Ao) when the X;,, are unitary: M,(A) C Ma( Ao) C M(Ao). 


In order to proceed with the explicit encoding, it is convenient to pass to 
the GNS triple (Hw, Uw, Rw); set Xi,~ := Tu(Xi;k) and denote by 


E,(B)= >> xX}, BRs, BEB), (8.91) 
kEI(i;) 


the GNS representation of the CPU maps E;, as CPU maps on B(HL,). More- 
over, let Fi, be their dual maps acting on B; (Ho), 


BL (HL) SPOR = dS > Xin PX! ,, (8.92) 
kEI(i;) 


and let Uz![A] := U pU} denote the Schrödinger time-evolution in the GNS 
representation. Then, the encoding procedure proposed in [8] is to assign to 
a string i”) € J, a density matrix Ali) according to the following scheme: 


= (Uz) off,,) o (Uz! of,,_,)o--- (Uz! off, 1.2.) 2a 1] , (8.93) 
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The states p(i") are the GNS representations of perturbed states obtained 
from the given O-invariant state w as follows: 


win = we | [] Ei, 0@ | = wo (Ei, 0) 0 (E; 0@)o--- (Ei, 00). (8.94) 


j=l 


Example 8.2.2. Let the encoding (8.93) be based on a Bernoulli quantum 
spin chain (Az,0,,w), where Az consists of single site algebras Ma(C), Oo 
is the right shift and w the product state 


w(A)=Tr(e@p@--p A), AEC Alpa]: 


p—q+1 times 


Since the Kraus operators from the various CPU maps in (8.93) are finitely 
many and belong to local subalgebras of Az, there exists an £ € N such that 
Xijk E€ At_eg for all i; € I4 and k € I(i;). With respect to (8.94), each 
O = O, shifts the Kraus operators of the CPU map to the right by one site; 
therefore, the Xix of Ei, 2 < j < n, will be shifted to the right by j — 1 
sites. Therefore, the perturbed states w;in) have the form 


(0) 
bi, 0 Bj, i 


1 1 = 
Wiin) = WO O) ig (n ) o ert | 


where the Kraus operators of a D OÎ—!(Xi k), belong to Aj_e4j—1, 045-1): 


J 
It thus turns out that the state wn) in (8.94) amounts to a density matrix 
pe) € Aje,e—n—1) tensorized with p over the sites k ¢ £, -— n — 1]. In 


the GNS representation, the corresponding f(i™) in (8.93) acts as a density 
matrix Oe) on Tul Aje t+n-1]) 8 Tw(Al_e,e4n-1]): 


Example 8.2.3. Given a quantum spin chain as encoder, single-site encod- 
ings turn out to be particularly useful; that is, we will use CPU maps con- 
sisting of Kraus operators Xip belonging to single site algebras Ma(C). The 
following CPU maps are three interesting possibilities. 


1. Let {|2)}4_, denote a fixed ONB in C4; then, consider the purifying maps 
Ma(C) > p = Filo] := Tr(p) |4)(a], i=1,2,...,d. 
These are the dual maps of the CPU maps 


d 
M,(C) > A > E,[B] = [k)i] Ali) k| = (il Ali) 1. 


k=1 
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The encoding is performed by choosing Xix = |7)(k|, i,k = 1,2,...,d, 
whence if Y € A is such that O7-1(Y) e A™M = Ap,-1, then 
wim (Y) = (4 | Rr-1(Y) |s™ ), where |i) := |1) 8 liz) @-+- lin). 
As a consequence, this particular encoding corresponds to a perturbation 
of w such that ples, = i )(i™ | € AM, 

. Consider the discrete Weyl operators in Example 5.4.2 with N = d and 
perform the encoding corresponding to the CPU maps 


Ma(C) 3 A= E,[A] = Wa(n)t AWa(n) , 


with n = (n1,n2), ni = 0,1,...d—1. The Kraus operators involved are 
thus of the form X;, = Wa(n;) where i = 1,2,...,d? enumerates the d? 
pairs n and k = 1 for all i. Then, choosing Y € A as in the previous case, 
the perturbed states turn out to be 


wi (Y) = Tr | Q) Walni,) Wiin) O8-'(¥) |, 


j=1 


corresponding to A™ > PRO = Qj- Walni) pWi(ni,). 

. Let p = ye ri|ri)(r;| be the spectral representation of the single 
site density matrix and | VJ) = X4; Vrj\r;7) & |r) its purification. 
Consider the GNS representation where | 2.,) = (|,/p)(./p|)®™; then, 
the encoding in the previous point yields a perturbed state (8.93) that 
amounts to a local density matrix of the form 


TA @m(AMY > Fo = Q) Walni,)@la| VPN VPI Wi (mi, Ola. 
j=l 


As in Section 6.3.1, the classical source A emitting symbols iMe It with 


probabilities p(i™) is described as a stochastic variable A™® with probability 
distribution ™ = PU hier The encoding (8.93) provides a statistical 


mixture described by the density matrix 


B, (Ho) 3 Pen) = 5 p(i™) p(a™) , 


imer} 


and any decoding POVM B™ = {BO hier by means of operators in 


B(HL.,) 


defines another stochastic variable B®). The mutual information (6.32) of 


A™) and B™ is bounded by the Holevo y quantity (6.33), 


1AM; B®) <8 Gem) — YD w™)S (AE) , 


im) Erm 


(8.95) 
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and depends on the source probability T4), on the CPU maps implementing 
the encoding €(”) and on the POVM BO). 


If the encoding (8.93) is based on Bernoulli quantum spin chains as in 
Example 8.2.2, then 


Pem = 5 pa™) pi) = Fyn | 


MET 


Q)(Q|] , (8.96) 


where, using the argument which led to (8.66), VY” is a localized POVM 
whose elements are operators of the form 


OL (Xaia) (Xinkai) a Xioko € Al—2,04n-1] , 
with Xi x € Aj_¢g. Then, from (8.95), (8.67) and Proposition 8.2.3 
IAW; BO) < S (Bem) = 8 FS ll 22N) = S (D) 
< (n — 24) (S (p) + log d) . (8.97) 


Indeed, p™® [Y] results from the tensor product density matrix p8 -29 on 
the algebra M4(C)® 29., 

Furthermore, if the decoding is operated by means of local POVMs B con- 
sisting of operators B; € Afp], then in (8.95) one can substitute the density 
matrices P(i™) and fein) in the GNS representation with local density matri- 
ces pl°e(i™) and ples = Dio p(i™) p° (i™). The corresponding Holevo’s 
bound reads 


(A; BO) <8 (08%) — SD ve) (G). (898) 


im eI} 
This bound and the fact that the various states are matrices in M4(C)® "728 
imply that, for encodings 6 by means of generic POVMs in A, 
I(A™; B™) < (n— 20) log d, (8.99) 
while, for POVMs B consisting of bistochastic maps 
I(A™; B™) < (n — 26)(log d— S(p)) , (8.100) 


for the encodings are entropy increasing so that S(p!°°(4) > S (p2 ("-29). 


Example 8.2.4. With reference to the three encodings in Example 8.2.3, 
the Holevo x quantity, denoted by x1,2,3 for sake of simplicity, depend only 
on the structure of the perturbed states restricted to the first n sites of the 
quantum spin chain A. By choosing uniform Bernoulli probability distribu- 
tions p(i”) = []j=1 pi; over the indices i, the product structure of the 
perturbed states yields: 
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1. Let m = {pi = 1/d} hay since Pg, = | a |, Sia) = 0 and 


i) 


n oc 1 a 
X plil pe i= (=) xı =nloøgd. 


i) 


2. Let m = {pi = 1/P}Ë4; since ple, = Q? Walni,) oW} (ni), addi- 
tivity of the von Neumann entropy (5.160) and unitarity of the Weyl 
operators imply S$ (Ro) = nS (p). Further, from (5.30) it follows that 


SË, Walni) pW} (n) = dig. Thus 


3. Since oe) = Qj- Wa(ni;) Q la| YP) /p| Wi (ni,) ® l4 are pure, 
S(ARS,) = 0. Also, using again (5.30), it follows that 


d2 


= D Walni)8 la | VP) VPW} (n:)8 la = 18Tr(| VP) vB) = tee, 


where Tr; denotes partial trace over the first factor. Therefore, choosing 
the probability a as in the previous point, 


Cd i Oc 1 $ 
Loe) Aas = (He p) = x% =nllog d+ 5(9)). 


i(n) 


According to Section 3.2.2, the classical capacity of the channel resulting 
from the considered encodings is: 


1 
C= sup lim sup —1(A™; B®) , (8.101) 
n 


a(n) ; Eln) A Bi) n—-0o 


where the supremum is computed varying probabilities, encoding and decod- 
ing protocols. The following possibilities are envisaged: 


1. entanglement assisted capacity, Cont, when B= {Ble Ie C B(H.,) con- 
sists of bounded operators on the GNS Hilbert space; 

2. ordinary capacities, C > Cy > Cu, when B = {B;}ier,; C A and the en- 
coding €(”) is performed with any localized CPU map (C) with localized 
bistochastic CPU maps (C%) and with localized CPU maps consisting of 
unitary Kraus operators (Cu); 

3. Bernoulli capacities, C9, > C? > G? >cC 


ent ui 


when the supremum is taken 
over input probabilities that factorize p(é™) = Pa Bey: 
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Remark 8.2.6. The entanglement in the capacity Cent is due to the GNS 
vector | 2,,) being entangled over the algebra mu (A) ® 7,(A)’ and the con- 
sidered POVMs B consist of generic operators in B(H,,). The entanglement 
of | 2,,) is most simply seen in the case of a Bernoulli quantum spin chain 
as the one discussed in Example 8.2.3; there, the GNS construction amounts 
to purifying the single site density matrix p into a vector | \/p) € C? @ C4 
which entangles Ma(C) ® 1 with 1@ Ma(C) at each single site. 


The entanglement assisted capacity of Bernoulli quantum sources is 
bounded by the AFL entropy, for all triplets (A,O,w), while the capacity 
equals the AFL entropy in the case of Bernoulli quantum spin chains. 


Proposition 8.2.8. The entanglement assisted capacity relative to Bernoulli 
classical sources encoded by using quantum dynamical systems (A,O,w), is 
bounded by the AFL entropy: Coy, < hAF™ (O, Ao). 

Moreover, the capacities of encodings by Bernoulli quantum spin chains 


(A, @>,w,) can be explicitly computed: 


C? = C} = Cu = Cr = log d — S(p) (8.102) 
C° =C = log d (8.103) 
Co, = Cont = S(p) + log d . (8.104) 


Proof: As regards the first part of the proposition, the source probabilities 
pi) = Ma pi, by assumption; therefore, (8.93) and (8.66) yield 


1 
Ber = >> 0) eG) = | [Uso SS pF; | (1 2.) 2 


i™ eI} j=n ijEIa 


= Ug” o Ffm [|o X Rol], = {VPiXix} iste, ; 


Then, from (8.67) and (8.95) 1(A™; B™) < S (Bem ) = S (P™¥]), whence 
the result follows from Definition 8.63. 

Concerning the second part of the proposition, for single site encodings 
of Bernoulli classical sources by means of Bernoulli quantum spin chains, we 
can use the result in Theorem 7.3.4. It ensures that one can always find a 
suitable decoding POVM such that the asymptotic amount of transmitted 
information per symbol, namely the argument of the supremum in (8.101) 
equals the corresponding Holevo y quantity. Consider now the first case in 
Example 8.2.4, x1/n = log d and (8.99) imply that log d < C° < C < log d, 
whence (8.102). In the second case y2/n = log d— S(p), this and (8.99) imply 


log d — S(p) < CP < Cy < log d — S(p) . 
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Thus, (8.103) results from the fact that C? < CP and Cu < Cy. Finally, 
by means of the same argument, (8.104) follows from the third case in the 
quoted example and from (8.97): 


log d+ S(p) = xs < Ch < Cr < log d+ S(p) . 
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The recent book [222] represents a most up to date and complete approach 
to CNT entropy and its applications to C* and von Neumann dynamical 
systems of physical and mathematical origin. It also presents in full detail 
the approach to the CNT entropy developed in [264] and the construction 
of dynamical and topological entropies due to Voiculescu [311]. Another for- 
mulation of a quantum topological entropy, namely of a quantum dynamical 
entropy independent of the given invariant state, can be found in [154]. 

Older books also dealing with the CNT entropy are [22, 226], while a de- 
tailed presentation of the AFL entropy ad its applications is in [10]. The re- 
lations between the CNT entropy and the AFL entropy are reviewed in [155]. 

A different proposal of quantum dynamical entropy based on coherent 
states and suitable to applications to quantum chaos and the semi-classical 
limit is in [282, 281, 74]. Possible applications of quantum dynamical entropies 
to chaotic phenomena in quantum spin chains are discussed in [243]. 


9 Quantum Algorithmic Complexities 


As already emphasized in Chapter 4 information is physical and the limits 
to information processing tasks are ultimately set by the underlying phys- 
ical laws. For instance, the standing models of computation are based on 
the physics of deterministic and/or stochastic classical processes; instead, 
quantum computation theory [66, 128, 224, 165, 71] studies the new pos- 
sibilities offered by a model of computation based on quantum mechanical 
laws. The birth of such a theory finds a technological motivation in the high 
pace at which chip miniaturization proceeds. Indeed, information processing 
at the atomic level, namely at a scale where the physical laws are those of 
quantum mechanics, might soon become a concrete practical issue [66]. On 
the other hand, a strong theoretical impulse to the development of quantum 
computation theory came from Feynman’s suggestion [116, 139] that quan- 
tum computers might provide a more efficient description of quantum systems 
than classical (probabilistic) computers and, above all, from the discovery of 
quantum algorithms with more efficient performances with respect to what 
is classical achievable [224]. 

A first theoretical step in this direction was the extension of the notions 
of TM and of UTM to those of quantum Turing machines (QTMs ) [99] 
and to universal QTMs (UQTMs ) [56]: very roughly speaking, these latter 
are computing devices that work as classical TMs and UTMs , the only 
difference being that their configurations behave as vector states of a suitable 
Hilbert space. Namely, given any set of possible configurations, their linear 
superpositions are also possible configurations. 

Once the existence of UQTMs is foreseen, a very natural theoretical step is 
to try to formulate quantum versions of the concepts introduced in Chapter 4; 
in particular, by extending algorithmic complexity theory to the quantum 
setting, one may try to set up a theory of randomness of individual quantum 
states [120] and of quantum processes. 

In the following, we shall consider some proposals that have recently been 
put forward concerning different ways in which one might approach the al- 
gorithmic complexity of quantum states. All proposals start from the basic 
intuitive idea that complexity should characterize properties of systems that 
are difficult to describe; they can roughly be summarized as follows: 
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1. one may attempt to describe quantum states by means of other quantum 
states that are processed by UQTMs [57]: the corresponding complexity 
will be referred to as qubit quantum complexity and denoted by QC; 

2. one may decide to describe quantum states by classical [309] programs 
run by UQTMs : the corresponding complexity will be denoted by QC, 
and referred to as bit quantum complexity; 

3. one may choose to relate the complexity of a qubit string to the complex- 
ity of the (classical) description of the quantum circuits that construct 
the qubit string [205, 206]. The corresponding complexity will be denoted 
by QCnet and referred to as circuit quantum complexity; 

4. one may extend the notion of universal probability (see Section 4) and 
define a quantum universal semi-density matrix [119]. There then arise 
two possible definitions of quantum complexity, denoted by QCẸp. that 

do not refer either to QTMs or to circuits. 


We shall mainly concentrate on the qubit quantum complexity QCg: it al- 
lows for a quantum generalization of Brudno’s theorem that will be presented 
in detail. On general grounds, one should not expect the above proposals to 
yield equivalent notions; very likely, each one of them will be sensitive to 
different specific quantum properties, as we have seen to be the case with 
the quantum extensions of the KS dynamical entropy. Unlike in the classical 
domain (see Remark 4.3.1.4) where chaoticness and typicalness appear to 
be equivalent characterization of random bit sequences, qubit sequences are 
likely to be random in different inequivalent ways. 


9.1 Effective Quantum Descriptions 


The notions of qubit and bit quantum complexity are based on the use of 
QTMs. In the following, we will not consider what quantum computers might 
do that classical computers do not, nor will we address their practical im- 
plementation (see for instance [128, 224]). We shall simply assume that such 
devices exist and proceed to define: 


the targets of the algorithmic descriptions processed by QTMs ; 
which kinds of algorithms are processed by QTMs ; 

how these algorithms are processed by QTMs ; 

which are the outputs of these processes. 


es T 


1. In the quantum setting, the targets of the effective descriptions will 
be qubit strings; since one is always interested in targets of increasing length, 
a convenient mathematical framework is provided by quantum spin chains 
(see Section 7.1.5), namely by algebraic triples of the form (Az,0,,w) that 
have been introduced in Definition (7.1.11), with 2 x 2 matrix algebras at 
each site. As already noted in Remarks 7.3.1, in going from bit to qubit 
strings there are similarities, but also differences. In particular, there is a 
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larger variety of qubit strings. Therefore, by qubit strings it will be meant 
any local density matrix corresponding to generic mixed and entangled states 
on local subalgebras A™ = (M2(C))®”. 

2. The inputs to QTMs will be generic qubit strings, loosely referred 
to as quantum programs; a subclass of these are the classical programs or 
bit strings that QTMs, as extensions of classical TMs, must also be able to 
process. 

3. While classical TMs ultimately amount to specific transition functions 
between their configurations, QTMs are defined by transition amplitudes 
between their configurations which form a Hilbert space. Any QTM will thus 
identify a specific quantum computation that is a specific unitary operators 
acting on the Hilbert space of its configurations. 

4. Finally, the outputs of a quantum computation operated by a QTM 
will be a qubit string read out by a measurement process. 


Within the framework just outlined, the first two generalizations of clas- 
sical algorithmic complexity previously mentioned are based on qubit strings 
effectively described by bit strings in the first case and by qubit strings in 
the second one. 


9.1.1 Effective Descriptions by qubit Strings 


Given the quasi-local structure of quantum sources as C* algebras generated 
by local n-qubit sub-algebras A“) = M2(C)®*, let us denote by Hy := (C?)®* 
the Hilbert space of k qubits (k € No) and fix in each single qubit Hilbert 
space C? a computational basis |0),|1). In order to be as general as pos- 
sible, superpositions of qubit states of different lengths k are allowed: they 
correspond to vectors in the Fock-like Hilbert space Hp := Bro He. More 
in general, qubit strings will be represented by density matrices p € B} (Hp) 
acting on Hp. 


Example 9.1.1. Any bit string i € {0,1}* identifies a computational basis 
vector in Hp: the empty string À corresponds to the vacuum | 2), the 1- 
qubit subspace H; is spanned by |0), | 1), while the k-qubit subspace H, is 
generated by the vectors corresponding to the bit strings of length k, iM € 
2), namely by |i) ) = | iiz- ig), ij = 0,1. Generic qubit strings amount 
to density matrices in i (H<n) acting on H<, := Dr-o Hg, its dimension 
being Pozo 2% = 2 — 1. 


In the commutative setting, the length of a bit string is simply the number 
of bits it consists of; in the quantum setting, the number of qubits involved 
fixes the Hilbert space dimension. Therefore, the following definition naturally 
extends the notion of length of a program. 
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Definition 9.1.1. The length, ¢(p), of a qubit string p € Bf (Hp) is 
(p) := min{n € No | p € BY (H<p,)} , (9.1) 


setting L(p) = œ if this set is empty. 


As we shall see, QTMs act on and construct superpositions of vector qubit 
strings and, more in general, convex combinations of projection onto vector 
qubit strings of different lengths. Moreover, like their classical counterparts, 
QTMs comprise different parts as a read/write head, a control unit and one or 
more tapes all of them capable of being in states that are either Hilbert space 
vectors or density matrices acting on them. Therefore, the QTMs configura- 
tions too are generically described by density matrices acting on appropriate 
Hilbert spaces. Notice that mixed states are quite typical in such a context 
for they naturally appear when one is interested in the state of the read/write 
head, say, and therefore traces over the Hilbert spaces corresponding to the 
other QTMs components. 

As observed in Remark 7.3.1.3, unlike in the classical situation where there 
are countably many bit strings, there are uncountably many qubit strings 
that can be arbitrarily close to one another. In order to quantify how close 
two qubit strings p,o € BI (Hp) actually are, it is convenient to use the 
trace-distance introduced in Definition 6.3.4, D(p,¢). 


9.1.2 Quantum Turing Machines 


Any model of computation is based on the physics of the processors perform- 
ing the computations; both deterministic and probabilistic Turing machines 
(see Example 4.1.4) work according to the laws of classical physics. It was 
Feynman [116] who was the first one to argue that quantum processes, to be 
efficiently simulated, require quantum computers. Indeed, quantum mechan- 
ics allow superpositions of states; in the case of TMs, the natural classical 


states are their configurations c := (loiiez q, k) (see Definition 4.1.1). The 


main feature of quantum computing machines is the possibility of producing 
and acting on linear superpositions of classical configurations, thus of per- 
forming in one single step of a computation what, classically, would only be 
achieved by an enormous number of TMs working in parallel (this is quantum 
parallelism, a phenomenon briefly sketched in Example 6.1.1). 

The Hilbert space spanned by the classical configurations |c) provides 
(vector) states |W) = X ecc ¥(c)| c) of the QTM, with Fourier coefficients 
W(c) that represent the complex amplitudes associated to the computational 
steps c. As in the case of PTMs, a quantum computation corresponds to a 
level-tree with an initial configuration branching into others, the main dif- 
ference being that the edges leading from one level to the next do not carry 
branching probabilities, rather branching amplitudes that give rise to inter- 
ference effects. 
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Example 9.1.2. With reference to Example 4.1.4 [128], consider the fol- 
lowing branching tree that resembles the scheme of a Mach-Zender inter- 
ferometer (see Figure 5.5). It describes a computational process starting 
off with an initial configuration co that branches into two different con- 
figurations ci; and cı2 at level 1 with amplitudes ao; := a(cocii) and 
ao2 := a(cociz) = 271/2, so that |ao1|? + |ao2|? = 1. This first computational 
step is then followed by a second one with two configurations at level 1 branch- 
ing as follows: c11 into c21 and c22 with amplitudes a11 := a(¢11¢21) = 1/2 
and aj2 := a(ci1¢22) = 1/V2, while c12 into c23 and c24 with amplitudes 
093 := a(c12€23) = —-1//2 and a24 := a(c12C24) = 1/V2. 


Fig. 9.1. Quantum Turing Machines: Level Tree 


Thus, the overall amplitudes for the 4 configurations at step 2 are 


1 1 
a(c21) = 401 441 = 2 ; a(c22) = 401412 = z ; 


1 (ei) 1 
= Cc = SS) 1 og 
2° a\ C24 492424 2 


a(C23) := a92423 = — 


The most important difference with respect to classical PTMs is now ap- 
parent; indeed, consider the case of equal configurations c22 and c23, say 
C22 = C23 = c*. Then, the amplitude for c* is the sum of the amplitudes for 
C22 and c23, a(c*) = 0, whence p(c*) = |a(c*)|? = 0. The corresponding de- 
structive interference eliminates the configuration c* from the computation. 
On the other hand, assume c21 = C24 = cx; these two configurations con- 
structively interfere at level 2 so that a(c.) = 1, whence cą appears among 
the computational steps with probability p(c.) = 1. 


The notion of QTM as a computing device working according to quantum 
mechanics was first proposed by Deutsch{99]. A full and detailed analysis can 
be found in [56] and [3] and further developments in connection with the 
notion of universality in [208]. In the following, we shall assume the existence 
of such machines and provide a schematic presentation of how they perform 
their tasks. QTMs work analogously to classical TMs, that is they consist of 
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1. 


Hy 
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An internal control unit C with associated Hilbert space He linearly 
spanned by the classical control states q;, i = 1,2,...,|Q|, the typical 
control vector being 


IQI IQI 


|Wo) = $5 efi) lai) , dole? = 1. 


i=l 


We shall distinguish special initial and final control states qo and qr, 
respectively. 


. An input/output tape, whose vector states are of the form 


\Yr)= S° tlo), 


ocx 


where o@ € X? denotes any sequence consisting of infinitely many 
blanks and only finitely many symbols from the alphabet X = {0,1} 
(see Section 4.1.1). The basis states |ø } correspond to classical tape- 
configurations and span a (separable) tape Hilbert space Her. 

A read/write head H that can position itself on the tape cells labeled 
by the integers k € Z. The head Hilbert space Hy is formed by square- 
summable sequences and the typical head vector state is 


Yo) =U wA)IE), SO RWP = 1. 


keZ keZ 


A QTM 4 will then be described by means of a Hilbert space of the form 
= Hr ® Hc ® Hy with the configuration basis vectors | a, q, k ) providing 


a distinguished orthonormal basis. 


The time-evolution of standard quantum mechanical systems, that is 


isolated from their environment, is linear and reversible; as any step of a 
quantum computation corresponds, in absence of external noise, to a phys- 
ical quantum process, it must be described by means of a unitary operator 


Uy 


: Ay > Hy. 


Remarks 9.1.1. 


1. 


The probabilistic transition functions (4.4) are replaced by a quantum 
transition function which assigns amplitudes (not probabilities) to the 
transitions (q,0) + (q',o',d): 


(q,0;4',0',d) + 5(g,0;q',0',d) € CO , (9.2) 


where Č denotes the set of complex numbers a € C, 0 < |a| < 1, such that 
there is a deterministic algorithm that computes the real and imaginary 
parts of a to within any fixed precision 27” in time polynomial in n. 
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Consider the linear operator Uy on Hy whose matrix elements with re- 
spect to the configuration basis vectors are defined by [229] 


(d',o, k'| Us |q, 0, k) = eo R if k =k-1 


daoni onti it E=ktig ©? 


where d = +1 identify a head’s movement to the left d = —1), respectively 
to the right (d = +1), and the tape (classical) configurations ø and o’ 
are such that their symbols øo; = o’ for all 7 # k: 

In this way the quantum transition function 6 identifies the possible tran- 
sitions operated by the linear operator Uy; namely, Uy operates a tran- 
sition 


tape conf. : o tape conf. a! 
cell with head on it : k > +> cell with head on it: k + d 
symbol in cell k : ok symbol in cell k : o}, 


if and only if ô(q, ok, q', op, d) # 0. 
Using the orthogonality of the configuration vectors, one explicitly com- 
putes the action of Uy as 


Uy|q,0,k) = Xo óld, Oki d, 0%, +d) Id,ok j +d) ? (9.4) 


qd Opd 


where øg', denotes the tape-configuration with all symbols equal to those 
of ø, but for the k-th one. In [229] necessary and sufficient conditions are 
given on the quantum transition function 6 so that Uy acts unitarily on 
Hy and thus appropriately describes a quantum computation as a unitary 
discrete-time quantum evolution. 

. A possible model of a QTM is obtained via a quantum circuit consisting 
of unitary gates (see Example 5.5.9), a so-called circuit model. A quan- 
tum computation on, say, N qubits thus amounts to a unitary operator 
U acting on a 2% dimensional Hilbert space Hy. It requires a certain 
number of gates to be implemented; if one had at disposal all 1-qubit 
unitary gates plus the CNOT 2-qubit gate, then any U would be exactly 
implementable [165]. In particular, the action of U : Hy |> Hy ona 
given state |Y) requires O(2") of these gates to be implemented [206]. 
More constructively, one seeks finite sets of gates G that would provide a 
so-called complete gate basis in the sense that the action of any 1-qubit 
gate can be mimicked by gates from G up to an arbitrary precision: one 
such set consists [165] of the CNOT gate, the Hadamard gate and the 


1-qubit gate 
—in/8 0 
e 
T := ( 0 ns | d 


Consider a generic unitary action U : Hy + Hy of a quantum circuit 
consisting of m 1-qubit and CNOT gates; a result known as Solovay- 
Kitaev theorem |170, 306, 165, 206] states that that U can be reproduced 
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up to any £ > 0 by O(mlog® m/e) gates from G with c € [1,2]. Then, on 
a given |y) € Hy, ||((U—V)|wv)|| < £ where V : Hy + Hy is a unitary 
operator corresponding to a quantum circuit consisting of N(U, £) gates 
from G, where [206], 


N(U,e)=O (2" log® (=)) 


3. Another model of quantum computation is based on the possibility of 
implementing unitary operations on qubits by means of the mechanism 
outlined in Example 6.1.5. This latter is at the root of the so-called one- 
way quantum computation [249, 323], whereby quantum gates, that is uni- 
tary transformations, on qubits are implemented by performing measure- 
ments, that is irreversible operations, on some other qubits, all of them 
prepared in certain entangled multipartite states called cluster states. 


Definition 9.1.2 (QTMs: Starting and Evolution Conventions). 
Given a UQTM 4 and an input qubit string o € BY (Hp), we shall identify 
it with the initial state of a quantum computation by U again denoted by o. It 
corresponds to a density matrix acting on Hy with o written on the input track 
over the cells indexed by [0, l(c) — 1], and blank states # on the remaining 
cells of the input track and on the whole output track, while the control is in 
the distinguished initial state qo and the head is in the state corresponding 
to its being positioned upon the 0 cell. The state U (0o) of U on input o after 


t € No computational steps will be given by U'(a) := Ua (U4). 


In the rest of this section we shall deal with the halting conditions for 
QTMs and with showing that their actions amount to definite quantum op- 
erations, that is to trace-preserving completely positive maps on B} (Hp). 
For this observe that, in accordance to the previous definition, the state of 
the control after t steps is given by partial trace over all the other parts of 
the machine, that is over the head and tape Hilbert spaces, Hc and Hr, 
respectively, UG(o) := Try,r (U'(c)). 


Definition 9.1.3 (QTMs: Halting Convention). A QTM 4 halts at time 
t € No, that is after t computational steps, on input o € i (Hr), if 


(as |Mo(o) lay) =1 and (af |MG(e) lap) =0 for every t <t, (9.5) 


where qr is a special control state. 
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Remark 9.1.2. The above halting convention expresses the possibility of 
checking whether a QTM halts on a certain input by measuring the orthog- 
onal projection | qf )(q,|: if U has halted on input ø then, by measuring 
lar)(af|, one ascertains this fact with certainty. On the other hand, if U 
has not yet halted then measuring | qs )(qy| has no effect on the still go- 
ing on computation. In general, for a generic input o = |y yy | € BY (Hp), 
0 < (a¢|LE(|e)(v]) las) < 1; in such a case the vector |y) will be called 
non-halting, otherwise t-halting. Let H(t) C Hp denote the set of vector in- 
puts with equal halting time t: their linear combinations are also inputs such 
that 4 halts on them at time t. Therefore, H(t) is a linear subspace of Hp; 
what is more important, if t 4 t’, the corresponding subspaces H(t) and H(t’) 
are mutually orthogonal. Indeed, were this not true, non-orthogonal vectors 
could be perfectly distinguished by means of their different halting times. It 


follows that the subset B7 (Hr) on which 4 halts is the union U,<,- BY (H(¢)). 


It proves convenient to consider a special class of QTMs with the property 
that their tape T consists of two different tracks, an input track I and an 
output track O. This can be achieved by having an alphabet which is a 
Cartesian product of two alphabets, in our case X = {0,1,#} x {0,1,#}. 
Then, the tape Hilbert space Hr can be written as Hr = Hy ® Ho. 


Definition 9.1.4 (Quantum Turing Machines). 
A map U : B} (Hr) —> Bf (Hp) will be called a QTM, if there is a two- 
track QTM 4 with the following properties [56]: 


1. the alphabet consists of X = {0,1,#} x {0,1,#}; 

2. the corresponding time evolution operator Uy is unitary,; 

3. if U halts on input o with a variable-length qubit string p € BY (Hr) on 
the output track starting in cell 0 such that the i-th cell is empty for every 
i £ [0, 2(p) — 1], then U(o) = p; otherwise, U (o) is undefined. 


In general, different inputs ø have different halting times t(o) and the 
corresponding outputs result from different unitary transformations ue, 
However, notice that the subset of Bt (Hy) on which U is defined is of the 
form Uen BY (H(®). Therefore, by introducing an internal clock that keeps 
track of the halting times, the action of U restricted to this subset amounts 


to a well-defined quantum operation, that is to a completely positive map 
U: B} (Hr) — Bf (Hr). 


Lemma 9.1.1 (QTMs as Quantum Operations). 
For every QTM 4 there is a quantum operation U : B} (Hr) — BI (Hr), 
such that U(o) = Ufo] for every o € Uren BY (H(t). 
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Proof: Let B, be an orthonormal basis of H(t), t € N, and By, an or- 
thonormal basis in the orthogonal complement of ,¢xj H(t) within Hp. Let 
an ancilla Hilbert space Ha := (No) be added to the QTM, and define a lin- 
ear operator Vy: Hr —> Hy SHa by specifying its action on the orthonormal 
basis vectors Uren{B;} U BL: 


__ J Uglb) @|t) if |b) € B, 
Val) =$ |b) @|O) ifb) EBL. 


The ancilla acts as a sort of internal clock which registers the halting times of 


the components of a vectors belonging to the halting subspaces and assigns 
time 0 to the non-halting components. With B, = {| bi, )}, B+ = {| by )}: 


Hy @Ha 5|%) = >> J Clos, ) + 2 r0) ) oF), 


t=0 Je 
Valy) =X) Ch) UE OG, ) lt) + X CE) |b; ) @[0) . 
t=0 je J 


From orthogonality, it turns out that the map Vy is a partial isometry: 
(V |Vi Va|) =(V|8), VU, SecHy@Ha. 


Thus, the map o +> Vuo Vi is trace-preserving and completely positive (see 
Section 5.2.2). Further, by partial tracing over the Hilbert spaces of the head, 
of the control unit, of the input tape and of the internal clock Hilbert spaces, 
one obtains the quantum operation Uf|o] := Tro,n,1,a (Vao Vg). 


We have seen in Chapter 4 that the definition of algorithmic complexity 
rests on a solid ground because the length of the shortest effective description 
of a bit string is essentially independent of the computer that computes it 
once this is chosen from the class of universal Turing machines. Clearly, any 
definition of quantum complexity based on using QTMs will also need the 
existence of universal QTMs in order to be essentially machine-independent. 

n [56], a UQTM 4U was constructed that works as follows: for any QTM 2 
there exists a classical description (bit string) iq of 2% such that 


D (Uia, T, YNYS), ADLN) <4 


for all inputs |Y) y| € BY (Hr), computational steps T and 6 > 0 with 
D(-,-) the trace-distance introduced in Definition 6.3.4. 

According to this definition {M is universal in that it simulates any other 
QTM up to an arbitrary accuracy for a given number of steps; notice that 
this latter piece of information must be part of the input. This means that, 
if A halts on a certain input, M is able to approximate the output of XY only 
if provided with the halting time. While such a definition works perfectly 
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well for the aims of [56] which are directed to see the impact of QTMs on 
computational complexity (see Remark 4.1.2), it is on the other hand not 
appropriate for an approach to quantum algorithmic complexity simply be- 
cause the halting times are likely to be enormous and in any case cannot be 
given beforehand. 

A useful definition of a UQTM Y for algorithmic purposes must then 
be independent from the halting time of the simulated QTM 2. The main 
problem is that, as the simulation is only approximate, such is in particular 
the simulation of the control state of A whence, when 2 halts, 4 will in general 
do it only with a certain probability thus violating the halting convention in 
Definition 9.1.3. In [208] it is showed how such a problem can be circumvented 
and how one can arrive at the following operative definitions of UQTM which 
is fully consistent from the point of view of quantum algorithmic complexity. 


Theorem 9.1.1 (Strongly UQTMs). [208] There is a QTM Y such that 
for every QTM % and every qubit string o for which A(o) is defined, there 
is a qubit string oq such that 


D (Ud, ox), U(o)) < å V5 E Qt, 


where (see Definition 9.1.1) Koy) < Lo) + Cm, D(-.-) is the trace-distance 
and Cy E N is a constant depending only on 2. 


In the following theorem, both the universal simulator, 4, and the QTM 
to be simulated, 2, are provided with a quantum input and a classical input 
fixing the accuracy 6 of the approximation. 


Theorem 9.1.2 (Parameter Strongly UQTMs). [208] There is a UQTM 
U with the properties of the previous theorem such that for every QTM & and 
every qubit string o, there is a qubit string oq such that, if A(2k,c) is 
defined, then 


1 
D (U(k, ox) ’ 2(2k,o)) x Jk Vk eN , 


and L(ox) < Lo) + Cy, Cy E€ N depending only on A. 


This result is not just a corollary of the preceding theorem: indeed, ac- 
cording to Theorem 9.1.1, the input, oq, may in general depend on k. 


Remark 9.1.3. A UQTM is able to apply a unitary transformation U on 
some segment of its tape within an accuracy of 6, if it is supplied with a 
complex matrix U as input such that 


494 9 Quantum Algorithmic Complexities 


~ ô 

0-8 < a 

d being the size of the matrix. The machine cannot apply U exactly; in fact, it 
only knows an approximation U. It also cannot apply U directly, for U is only 
approximately unitary, and the machine can only work unitarily. Instead, it 
will effectively apply another unitary transformation V which is close to U 
and thus close to U, such that ||V — U|| < 6. Let |Y) := U|wo) be the output 
that one wants to have from 4 and let |¢) := V|wWo) be the approximation that 
is really computed by the machine. Then, both the norm and trace-distance 


are small: |||Ø) ~|¥)I < ô D(|d)(ol1¥)(vl) <6. 


9.2 qubit Quantum Complexity 


As already remarked, unlike bit strings, qubit strings, are uncountably many 
and cannot be expected to be exactly reproducible by a QTM . It rather 
makes sense to try to approximate a target qubit string p by a qubit string p 
within a trace-distance 0 < D(p, p) < 1 (Ø ~ p). According to the previous 
section, p will be the output of a QTM Y that executes a quantum program 
o € BY (Hp): p:= Ufo] & p. 


Remark 9.2.1. In view of the definition of classical algorithmic complexity, 
one is particularly interested to seek whether the length of o (see (9.1)) can 
be made shorter than that of p itself: (o) < (p). The minimum possible 
length (ø) for reproducing p will get us close to the notion of qubit quantum 
complexity QC,. There are at least two natural possible definitions. The first 
one is to demand only optimal (in the sense of minimal length) approximate 
reproductions of p within some trace distance ô. The second one is based 
on the notion of an approximation scheme. In order to define the latter, the 
chosen QTM has to be supplied with two inputs, the qubit string and a 
parameter. 


Definition 9.2.1. Let k € N and o € BY (Hp). Let (k) denote the string 
that consists of the at most |log, k| bits of the binary expansion of k, each re- 
peated twice and ends with 01. Let | 3(k))( G(k)| be the corresponding projec- 
tor in the computational basis. The map (k,a) => C(k, 7) := | B(k))( B(k) |80 
defines an encoding C : N x B} (Hr) —> Bi (Hr) of a the pair (k,c) into a 
single qubit string C(k,a). Note that 


e(C(k,o)) = 2|logk| +2 + (0) . (9.6) 


We shall denote by U(k,o) the result of the action of a QTM 4 on C(k, co). 
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The above encoding has the typical self-delimiting form that we have 
already met in Example 4.1.5. In this way, the QTM {Y is able to detach in 
C(k,@) the information about k from that about ø. 


Definition 9.2.2 (qubit Quantum Complexity). 

Let be a QTM and p € B} (Hr) a qubit string. For every 5 > 0, the 
finite-accuracy quantum complexity QC? (p) is defined as the minimal length 
(a) of any quantum program o € BY (Hp) such that the corresponding output 
Uc) has a trace-distance from p smaller than 6, 


QC? (p) := min{ (o) : D (p, U(0)) < ò} ; (9.7) 


Similarly, an approximation-scheme quantum complexity QCy is defined as 
the minimal length €(c) of any density operator o € B} (Hp), such that when 
processed by U together with any integer k, the output U(k,o) has trace- 
distance from p smaller than 1/k, for all k: 


QCy(p) := min{ &(c) : D(p, Wk,o)) < L for every k € N} (9.8) 

We now show that theorems 9.1.1 and 9.1.2 allow one to prove the in- 
dependence (up to an additive constant) of the above definitions from the 
chosen QTM 4 if this is universal as specified in those theorems. Accord- 
ingly, we will fix an arbitrary UQTM and, like in the classical case, drop 
reference to it and set 


QC, (P) := QCu(p) , QC? (p) := QCA (Cp) . (9.9) 


Theorem 9.2.1. There is a QTM 4 such that for every QTM % there exist 
constants Cy > 0 and Casa such that for every qubit string p € BI (Hr) 
and 0 < ô < A, it holds that 


QCy(p) < QCy(p) + Ca, QC) < QCK(p) + Casa - 


Proof: Let l = QCÌ (p), then there exists o such that, according to (9.7), 
L= (0) and D({o], p) < 6. On the other hand, Theorem 9.1.1 implies that 
there exists a QTM Y and a density matrix oq such that 

D(U(A = 8,02), 2(0)) < A- 
whence, by the triangle inequality, 


D(u(A- 8,0), p) <A. 
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Moreover, (a4) < Lo) + Ca = QCÀ (p) + Cy; thus, with C(A — 8, ox) as 
in Definition 9.2.1, using (9.6) it follows that 


e(C(A —6,0u)) < (ox) + Cs,a < QCA(p) + Casa - 


whence QCi(p) < QCx(p) + Cas,- 

If € = QCy(p), then there exists a qubit string o such that £ = 
L(o) and D(Q(k,o),p) < 1/k for all k € N. On the other hand, Theo- 
rem 9.1.2 says that there exists a QTM MM and a density matrix oq such 


that D((k, oa) , A(2k, )) < m It follows that 
D(U(k, ox) , p) < D(U(k, oa) , A(2k,0)) J D(2(2k, 0) p) 


Together with the fact that Loy) < Ko) + Ca < QCy(p) + Ca, this implies 
QCy(p) < QCa lo) + Ca. 


Remarks 9.2.2. 


1. Definition 9.2.2 is essentially equivalent to that in [57], the only technical 
difference being the use of the trace distance rather than the fidelity. 

2. The same qubit program o is accompanied by a classical specification of 
an integer k, which tells the program to what accuracy the computation of 
the output state must be accomplished. Notice that in (9.8) the minimal 
length has to be sought among those ø such that anyone of them yields 
an approximation of p within 1/k for all k: this is an effective procedure. 

3. The exact choice of the accuracy 1/k is not important; choosing any 
computable function that tends to zero for k — oo will get an equivalent 
definition (in the sense of being equal up to some constant). The same 
is true for the choice of the encoding C: as long as k and o can both be 
computably decoded from C(k, a) and as long as there is no way to extract 
additional information on the desired output p from the k-description 
part of C(k,o), the results will be equivalent up to a suitable constant. 


Examples 9.2.1. 


1. If U is a UQTM, a noiseless transmission channel (implementing the 
identity transformation) between the input and output tracks can always 
be realized: this corresponds to classical literal transcription, so that au- 
tomatically QC%(p) < &(p) + cu for some constant cy. Of course, the 
key point in classical as well as in quantum algorithmic complexity is 
that there sometimes exist much shorter qubit programs than just literal 
transcription. 
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2. The finite accuracy and approximation scheme QC, are related to each 
other by the following inequality: for every QTM 4 and every k € N, 


QCl/* (p) < QC, (P) +2llogk] +2, Vp € BY (Hp) . 


Indeed, if QCy(p) = £, there is o € Bf (Hp) with (o) = @, such that 
D(U|k, oJ, p) < 1/k for every k € N. Then o’ := C(k,o), where C is the 
encoding in Definition 9.2.1, is such that D(U[o’], p) < 1/k and 


qc (p) < Ko’) < 2|logk| +24+ £= 2|logk| +2+ QC, (Pe) 4 


where the second equality follows from (9.6). 


9.2.1 Quantum Brudno’s Theorem 


In this section, we prove a quantum version of Brudno’s theorem (Theo- 
rem 4.2.1), by means of which we shall connect the quantum entropy rate 
s of an ergodic quantum spin chain to the qubit complexities QC, (e) and 
Qc? (p) of qubit strings that are pure states p = |w)(w| of the chain. It 
will be showed that there are sequences of typical subspaces of (C?)®", such 


1 1 

that the complexity rates —QC, (q) and =QCÅ (q) of any one-dimensional 
n n 

projector q onto a state belonging to these subspaces can be made arbitrarily 


close to the entropy rate by choosing n large enough. Moreover, there are no 
such sequences with a smaller expected complexity rate. 


Theorem 9.2.2 (Quantum Brudno’s Theorem). 

Let (A,w) be an ergodic quantum source with entropy rate s. For every 
6 > 0, there exists a sequence of w-typical projectors qn (8) € A™, n € N, 
ie. limno Tr(p gn(0)) = 1, such that for every one-dimensional projector 
q < qh) and n large enough 


“Qc, (q) E€ (s —6,8+5) , (9.10) 
Loo? (q) € (s —6(2 + ô)s,s +ô) . (9.11) 


Moreover, s is the optimal expected asymptotic complexity rate, in the sense 
that every sequence of projectors qn E€ A™, n € N, that for large n may 
be represented as a sum of mutually orthogonal one-dimensional projectors 
that all violate the lower bounds in (9.10) and (9.11) for some 6 > 0, has an 
asymptotically vanishing expectation value with respect to w. 


As for the proof of Brudno’s theorem 9.2.2, we first prove upper and then 
lower bounds [40]. 


498 9 Quantum Algorithmic Complexities 
Lower Bounds 


In the classical case, it has been showed that there cannot be more than 
2°+1 _ 1 different programs of length / < c and this fact has been used to 
prove the lower bound to complexity in Brudno’s Theorem. 

A similar result holds for QTMs , too. In order to show this, one can 
adapt an argument due to [57] which states that there cannot be more than 
2+! _ 1 mutually orthogonal one-dimensional projectors p with quantum 
complexity QC, (p) < £. The proof is based on the Holevo’s y-quantity (see 
Proposition 6.3.3); we shall use it to provide an explicit upper bound on 
the maximal number of orthogonal one-dimensional projectors that can be 
approximated within trace-distance 6 by the action of completely positive 
maps E on density matrices o of length @(c) < c. 


Lemma 9.2.1 (Quantum Counting Argument). 

Let0 < < 1/e, cE N such that c> = 3 (4+ 2log +), K a linear subspace 
of an arbitrary Hilbert space K, and E : B} (Hr) — BY (K) a quantum oper- 
ation. Let N be a maximum an pulsed of orthonormal vectors from 
the set A9(E,K) of all normalized vectors in K which are reproduced within 


ô by the operation E on some input of length < c: 


AR(E, K) = {|6) €K: og € BY (He), D Elosl,|4)(41) <5} , 


4 
Th log, |N®| < 1+ —— de. 
en, loga|No|<e+ va se 


Proof: Let ¢; € A%(E,K), j =1,...,N, a set of orthonormal vectors and 
VY denote the Abelian subalgebra of B(IK) generated by the corresponding 
projectors P; := |ġ;)(ġ;| and Pyyi := 1K — . P;. By the definition of 
AS(E, K), for every 1 <i < N, there are density matrices g; acting on H<, 


with D(E[o;], P;) < ô. 
N 


1 
Let g := PRE it also acts on H<. and dimH<, = 2°! — 1, 


i=1 
whence (6.33) yields y(E,) < c+1, where Es := {o,0;/N}. Then, consider the 
completely positive map Ey : B} (IK) > B}(K), p > Eyle] := YAT PeP,. 
Applying twice the monotonicity of the relative entropy under completely 
positive maps, 


te La 
VA S (Ey cE[o;], Ey oE[o]) < oD S (E[o;] , E[o]) 


For every i € {1,..., N}, the density matrix Ey o E[o;] is close to the corre- 
sponding one-dimensional projector Ey[P;] = P;. Indeed, (6.68) yields 
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D(Ey o Efo;], Ey[Pi]) < D(E[oi], Pi) <6. 


Let A := a P;. The trace-distance is jointly convex (see (6.69)), 


D(Ey o Ejo], A) < ODL zy o Eloi]), P) < ô. 


Since 5 < 4, Fannes inequality (5.157) gives 


S (Ey o Efø:]) = |S (Ey o Eloi) — $(P,)| < ôloga(N + 1) + (8) 


S (Ey o E[o]) — S(A)| < dlog(N +1) + 7(6) , 


where (0) := —d log, 6. Combining the previous estimates yields 


c+1>x(&) > (1 — 26) logy N — 26 — 2n(6) . 


Ifloga N > c+1+ 23 be, then c+1 > c+1+06(cd—4)+26 log 6, whence c < 
2 (2+ log +). Therefore, the maximum number |NÊ| of orthonormal vectors 


246 
in AÎ(E, K) must fulfil log, |N] <e+1+4+ T 


The second step uses the previous lemma together with Proposition 7.3.1 
about the minimum dimension of the typical subspaces. Notice that the limit 
(7.153) is valid for all 0 < € < 1. By means of this property, one proves 
the lower bound for the finite-accuracy complexity qc? (p), and then use 
Example 9.2.1.2 to extend it to QC, (p). 


1 
Corollary 9.2.1 (Lower Bound for =QCÅ (p)). 
n 


Let (Az,w) be an ergodic quantum source with entropy rate s. Further, let 
0<4d<1/e, and let (pn) nen be a sequence of typical projectors, according to 
Definition 7.3.2. Then, there is another sequence of typical projectors pn(d) < 
Pn, such that for n large enough 


1 
-QC (P) > s — (2 + ô)s 
is true for every one-dimensional projector p < Pn(ô). 


Proof: The case s = 0 is trivial, so let s > 0. Fix n € N, 0 < ô < 1/e and 
consider the set 


An (5) = {p < pn : P=|b 41, QCP) < ns(1 — (2 + 4))} 


From the definition of Qc? (p), for any of such p’s there exists a density 
matrix op with (op) < ns(1 — 6(2+ 6)) such that D(U(op), p) < 6, where, 
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as explained in Lemma 9.1.1, U(o,) is the result of the quantum operation 
U: Bt (Hr) — By (Hp) associated with the UQTM 4 that has been fixed as 
explained before Theorem 9.1.1. Then, using the notation of Lemma 9.2.1, 
An(6) C Ati 5(2-+6))] (U, Kn), where K,, is the typical subspace supporting 
Pn. Let pn(ô) < pn be the sum of any maximal number of mutually orthogonal 
projectors from AP no(1—6(248))] (U, Kn). If n is such that 


ns(1 — 6(2+6)) > L € | 21oga3 ) , 


Lemma 9.2.1 implies that 


ba ep) = st =f Sy 14 2T ainsa 5(2+6))] . (9.12) 


Therefore, no one-dimensional projectors p < pn(d)+ := pn — pn(d) exist 
such that p € AP six 5(2+8))] (U, Kn). Namely, one-dimensional projectors 


P < Pn(d)+ must satisfy 


LOCE (p) > s—4(2 + ô)s . 
n 


Since inequality (9.12)) is valid for every n € N large enough, 


1 554 
lim sup — logy Trnpn(d) < s — 26°s — 5 <s. (9.13) 
n 


n—00 1— 26 


From Proposition 7.3.1 limn—oo Tr(p™ pn(d)) = 0, whence Pn(d) := pn(d)+ 
provide the required sequence of typical projectors. 


1 
Corollary 9.2.2 (Lower Bound for —QC, (p)). 
n 


Let (Az,w) be an ergodic quantum source with entropy rate s. Let (Pn) nen 

with pn € A™ be an arbitrary sequence of typical projectors. Then, for every 

0 < ô< 1/e, there is a sequence of typical projectors Py(d) < pn such that, 

for n large enough, —QC, (p) > s —6 is satisfied for every one-dimensional 
n 


projector p < Pô). 
Proof: From Corollary 9.2.1, for every k € N, there exists a sequence of 
typical projectors Pn(Z) < pn, such that, if n is large enough, 


M 1o 1 
QC (p) >s Ctp 


for every one-dimensional projector p < pn(1/k). Then 
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: 1 : 2+2|log,k 
1 Qc, (p) > LQct* (p) — 2+ less Al 
n n z 


1 1 2(2 + log k 
>83 (2+7)s ae) 


k n 


where the first estimate is by Example 9.2.1.2 and the second one is true for 

one-dimensional projectors p < pn(¢) and n € N large enough. Fix a large k 
1 1 ô = 1 

satisfying Ê +—)s< 5 The result follows by setting p,(5) = Polg) with 


k 
k, and n such that 


1. 1. 8  2(2+log,k) 
2 


ô 
< ; 
n =D 


Upper Bounds 


The lower bound shows that, for large n, with high probability the qubit 
complexity of pure states of a quantum spin chain is bounded from below 
by a quantity which is close to the entropy rate of the chain. Similar upper 
bounds also hold from which Theorem 9.2.2 follows. 


Proposition 9.2.1 (Upper Bound). 

Let (Az,w) be an ergodic quantum source with entropy rate s. Then, for 
every 0 <6 < 1/e, there is a sequence of typical projectors pn(d) € A™ such 
that for every one-dimensional projector p < p,(d) and n large enough 


1 1 
—QC, (p)<st+6 and —Qc? (p)<s+o. 
n n 


The proof of this statement is obtained by explicitly providing, for any 
minimal projector p < p,(d) € A™, a qubit string P of length ¢(p) ~ n(s+6), 
that computes p with arbitrary accuracy. Such a qubit string is constructed 
by means of universal quantum typical subspaces introduced (see Defini- 
tion 7.3.3); its length is in general not minimal and only upperbounds the 
quantum complexities qc? (p) and QC, (p). However, it is shorter than the 
literal transcription of p (see Example 9.2.1.2): recall that the latter corre- 
sponds to a qubit string p comprising p itself plus the instructions to the 
UQTM UL to copy p, whence its length (p) ~ n > n(s +8) for n large enough 
and 6 sufficiently small. 

Let 0 < e < 6/2 be an arbitrary real number such that r := s + e€ is 
rational, and let {pn := QP nen be the universal projector sequence of 
Theorem 7.3.3, which is independent of the given state w as long as s(w) < s. 

Though the dimension of the subspace supporting Pn is ~ 2”, generic 
one dimensional projections q < Pn are not qubit strings as defined in 
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Section 9.1.1 and their lengths need not be (q) ~ nr. However, because 
of (7.154), if n is large enough then there exists some unitary transformation 
Ut that transforms the projector py, into a projector belonging to the state- 
space B? (H [nr]); Where [nr] is the smallest integer larger then nr. It follows 
that every one-dimensional projector p < Pn can be transformed into a qubit 
string p := UtpU of length ¢(p) = [nr]. 

According to Remark 9.1.3, p can be presented to the UQTM 4U together 
with some classical instructions including a subprogram for the computation 
of the necessary unitary rotation U. This UQTM starts by computing a 
classical description of the transformation U, and subsequently applies U to 
p, recovering the original projector p = U pU? on the output tape. 

Apart from technical details, the main point in the proof is the following: 
since the unitary operator U depends on w only through the entropy rate s, 
the subprogram that computes U does not have to be supplied with addi- 
tional information on w and its restriction to A(). Therefore, the additional 
instruction for the implementation of U will contribute with a number of 
extra qubits which is independent of the universal projection index n. 

The quantum decompression algorithm Ð will formally amount to a map- 
ping (r is rational) 


D:NxNxQxHp -— Hr, (k,n,r, p) => p= D(k,n,1,p) . 


Remark 9.2.3. The decompression algorithm D is due to be short in the 
sense of being ”short in description”, not short (fast) in running time or 
resource consumption. Indeed, the algorithm ® is in general slow and memory 
consuming; however, this does not matter. In fact, algorithmic complexity 
only cares about the length of the programs and not either in how fast they 
are computed or in how much resources they consume. 


In the following steps, D will deal with rational numbers, square roots of 
rational numbers, bit-approximations (up to some specified accuracy) of real 
numbers and vectors and matrices containing such numbers. Classical TMs 
as well QTMs can of course deal with all such objects. For example, rational 
numbers can be stored as lists of two integers (containing numerator and 
denominator), square roots can be stored as such lists supplemented with an 
additional bit to denote the square root operation, and, also, binary-digit- 
approximations can be stored as binary strings. Vectors and matrices are 
arrays containing those objects. They will be presented to the UQTM Y as 
vectors of the computational basis and operations on them, like addition or 
multiplication, will as easily be implemented as by classical computers. 


The instructions defining the quantum algorithm ® are as follows. 

1. Read n,r; find 2 € N such that 4. 23° < n < 2-€-23 with & a power of two 
(there is only one such £). Compute 7 := [4]. Compute R := r£. 

2. Compute a list of codewords or. belonging to a classical universal block code 
sequence of rate R. The construction of an appropriate algorithm can be found 
for instance in [168]. 
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Since 2 Cc ({0, D oN = {wi,we,...,war} can be stored as a list of 
binary strings. Every string has length ¢(w;) = nl and the exact value of the 
cardinality M ~ 2”? depends on the choice of a. 

3. Compute a basis {Agiz,..5in}} of the symmetric subspace 


SYM"(AM) := span{A®" : Ac AM}. 


Namely, for every n-tuple {i1,...,in}, where ip € {1,.. .,2°"}, there is one 
basis element Ajj, ,...iz} E Ar) given by 
= (én) 
Alinin} = 5 CS (its uin) ? (9.14) 


o 


where the summation runs over all n-permutations o, and 
(en) 


1th 


‘= a ® el? ®... Bel j 


with Tai a system of matrix units in A, In the computational basis, 
all entries of Such matrices are zero, except for one entry which is one. There is 
a number of d = (2 = dim( SY M"(A)) different matrices A{i;,...,i4} 
which we can label by {A;,}@_,. It follows from (9.14) that these matrices have 
integer entries and can thus be stored as lists of 2” x 2”‘-tables of integers 
without any need of approximations. 

4. For every i € {1,...,M} and k € {1,...,d}, let Juz) := Ax|wi), where |wi} 
denotes the computational basis vector which is a tensor product of |0)’s and 
|1)’s according to the bits of the string wi. Compute the vectors |ux,:) one after 
the other. For every vector that has been computed, check if it can be written as 
a linear combination of already computed vectors. (The corresponding system 
of linear equations can be solved exactly, since every vector is given as an array 
of integers.) If yes, then discard the new vector |ux,;), otherwise store it and give 
it a number. This way, a set of vectors flux) }P_, is computed. These vectors 


linearly span the support of the projector wip given in (7.157). 


5. Denote by NEAS the computational basis vectors of Hp-ze. If n = 02°", 
then let D := D, and let |x) := |ux). Otherwise, compute |urx}) @ |i) for every 
k € {1,...,D} and i {1,...,2"-™}. The resulting set of vectors {eE 
has cardinality D := D-2"-™. In both cases, the resulting vectors |a,) € Hn 
span the support of the projector Qe? =n. 

6. The set {lze} is completed to linearly span the whole space Hn. This will 


be accomplished as follows. Consider the sequence of vectors 


To = fey ufien} | 


where {Pp |i denotes the computational basis vectors of Hn. Find the smallest 
i—1 

i such that |%;) can be written as a linear combination of fi fj } , and 
j=1 

discard it (this can still be decided exactly, since all the vectors are given as 

tables of integers). Repeat this step D times until there remain only 2” linearly 


independent vectors, namely all the |£;} and 2” — D of the |®,). 
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7. Finally, apply the Gram-Schmidt orthonormalization procedure to the resulting 
an 
vectors, to get an orthonormal basis fim} of Hn, such that the first D 
k=1 
vectors are a basis for the support of Qe” = Pn. Since every vector |x;) and 
|;) has only integer entries, all the resulting vectors |y,) will have only entries 
that are (plus or minus) the square root of some rational number. 


Up to this point, the previous steps did not involve any kind of numerical 
approximation. Instead, the next ones will compute an approximate descrip- 
tion of the desired unitary decompression map U and apply it to the quantum 
state p. In view of Remark 9.1.3, the task is to calculate the number N of bits 
necessary to guarantee that the output will be within trace-distance 6 = 1/k 
of Ð. 


8. Read the value of k (which denotes an approximation parameter; the larger 
k, the more accurate the output of the algorithm will be). Due to the con- 
siderations above and the calculations below, the necessary number of bits N 
turns out to be N = 1 + [log(2k2"(10V2")?" )]. Compute this number. Then, 
compute the components of all the vectors (ua up to N bits of accuracy. 
(This involves only calculation of the square root of rational numbers, which 
can be done to any desired accuracy.) Denote the resulting numerically ap- 
proximated vectors by |.) and write them as columns into an array (a matrix) 
U := (91, %2,..., Gor). Let U := (y1,y2,-.., yor) denote the unitary matrix 
with the exact vectors |yx) as columns. Since N binary digits give an accuracy 
of 27", it follows that 


< 1/k 
2- 2n (10/27)2" ` 


|e. —Uij| < gN 


If two 2” x 2”-matrices U and U are e-close in their entries, they must be 
2” . e-close in norm, too. Whence we get 


1/k 
2(10/2")2” ` 
So far, every step could have been performed on a classical computer; the 


intrinsically quantum part starts when one consider the qubit string p, that 
is the input quantum program. 


IU - U|] < 


9. Compute [nr], which gives the length (p). Afterwards, move p to some free 
space on the input tape, and append zeroes, i.e. create the state 


p' = lvo) lyol == (|0)(0/) 2°" g7 


on some segment of n cells on the input tape. 

10. According to Remark 9.1.3, apply a unitary approximation to the unitary 
transformation U on the tape segment that contains the state p’, move the 
result onto the output tape and halt. 
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Proof of Proposition 9.2.1 The triple (n,r,@) can be encoded into a 
single qubit string o (note that the parameter k is not a part of ø) as follows. 
First, write both r and n in a self-delimiting way as computational basis 
vectors |8(r)), respectively |G(n)) (see Definition9.2.1), of length 2 log, r+ 2, 
respectively 2 logy n + 2,. 

Then, consider the projectors P, := |G(n))(G(n)|, P, := |G(r))(B(r)| and 
attach to them the rotated projector p = UÝ DU, so that the resulting input 
qubit string is o(p) := P, ® Pa ® p. If n fulfils (7.162), then 


l(a(p)) = 2|logyn|}+2+¢4+ [nr] , 


where C(r) € N is some constant which depends on C(r), but not on n. 

This qubit string is presented to the UQTM 4 together with a description 
of the decompression algorithm D of fixed length C’(r) which depends on r, 
but not on n. This will give a qubit string oy(p) of length 


L(oy(p)) = 2|log, n| + 2+ C(r) + [nr] +C'(r) 
< 2loggn +n (: + 5°) +C"(r) , 


where C” (r) is again a constant which depends on r, but not on n. The 
matrix U, whose construction is part of the decompression algorithm 9, 
rotates (decompresses) a compressed (short) qubit string p back into the 
typical subspace. Conversely, for every one-dimensional projector P < Pn, 
where Dn = Qe) was defined in (7.161), let p € H;,,) be the projector given 
by (0X0 pD @ p = UtpU. Then, since D is such that the trace- 
distance fulfils D(U(oulp), k), P) < ¢ for every k € N, it follows that 


1 l 1 g 
SE ES e teu TEN 
n n 2 n 


If n is large enough, then the first inequality in Proposition 9.2.1 follows, 
while the second inequality is proved by letting k := [5]. Then, for every 
one-dimensional projector p < Pn and n large enough 


1 1 1 2|log, k| + 2 
Lc? @ < Lact/* @ < Qc, p+ Hee +? 


2loggk+2 2 
n 


<s+064 


s+26, (9.15) 


where the first inequality follows from the obvious monotonicity property 
d6>e> Qc? (p) < QC; (p), the second one is by Example 9.2.1.2 and the 
third estimate is due to the first inequality in Proposition 9.2.1. 


Proof of Theorem 9.2.2 Let p,,(0) be the typical projector sequence 
1 
given in Proposition 9.2.1, i.e. the complexities —QC, (p) and =Qc? (p) of 
n n 
every one-dimensional projector p < p,,(d) are upperbounded by s + ô. 
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Due to Corollary 9.2.1, there exists another sequence of typical projectors 
a ae 1 : ; 
Tn(0) < pn(d) such that additionally, =Qc? (T) > s — 6(2 + 4)s is satisfied 
n 
for all one-dimensional projections m < 7,,(0). 
Also, from Corollary 9.2.2, there is another sequence of typical projectors 
1 
Tn(6) < m,(6) such that —QC, (7) > s — ô holds for all one-dimensional 
n 
projections 7 < 7,,(0). 
Further, the optimality of these upper and lower bounds, and thus of 


s as optimal expected asymptotic complexity rate, follows from applying 
Lemma 9.2.1 together with Proposition 7.3.1. 


Remark 9.2.4. Unlike in Theorem 4.2.1 where the result holds almost ev- 
erywhere, its quantum generalization given above essentially holds in proba- 
bility. The major obstruction to a stronger quantum version comes form the 
difficulty of extending to qubit strings what is natural for bit strings, namely 
their concatenation [60]. 


Example 9.2.2. Consider a quantum spin chain (A,w) of Bernoulli type 
with a state w which is the tensor product of tracial states p = 12/2 for each 
qubit; this quantum source is mixing, thus ergodic and its entropy rate is 
s = —Trp log» p = 1. Then, the quantum version of Brudno’s theorem states 
that there exists a sequence of subspaces K,, C Hp of high probability, such 
that for any £ > 0, by taking n sufficiently large, 


1 
1L-e< QO, (\¥)(¥|) site, 


for all qubit pure state VW € Kan. 


9.3 cbit Quantum Complexity 


A different approach to quantum algorithmic complexity is proposed in [309] 
where as effective descriptions of n-qubit strings |W) € Hn one chooses bit 
strings corresponding to self-delimiting classical programs p € 25 instead 
of generic qubit strings. These classical programs are presented to a fixed 
UQTM 4 as computational basis vectors |p) which, after being processed 
by U, outputs normalized vectors |U(p)) € H,,. Furthermore, the difference 
between the output | L(p) ) and the target |W) is taken care of by the scalar 
product (W | L[p] ). 


Definition 9.3.1 (bit Quantum Complexity). The bit quantum complex- 
ity QC, (W) of n-qubit vector states |W) € Hn is 


QC, (W) := min{ Kp) + [— 10g |v |Mlp})I?] } , 


where p € {25 is any self-delimiting binary program. 
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The logarithmic correction acts as a penalty for bad approximations: 
— log, |(¥ | U[p] )| diverges for an effective description of ¥ which yields a 
vector orthogonal to it, while it vanishes when | L(p)) ~ |W). Therefore, the 
bit quantum complexity results from a tradeoff between the length of the 
classical description and the permitted errors. 


Example 9.3.1. Let a vector ¥ € H, be called directly computable if there 
exists a self-delimiting program p € $23 such that |U(p)) = |W). Then, 
consider an orthonormal basis B := {|b;)}?", in H, entirely consisting of 
directly computable vectors. Let K(B) denote its classical prefix-complexity 
achieved by a self-delimiting program qg, K(B) = (qg). Let us fix | b;) € B; 
if p; is any program such that | LU(p;)) = |b;) then no penalty for a bad 
approximation is to be payed and, with p, the shortest among such programs, 


QC, (bi) < Up). C) 
On the other hand, let QC, (b;) be attained at qx € 23, namely 


QC. (bi) = Kaa) + |- 10ga (Ua) 1b:)1?] 


By letting & process the binary programs in dovetailed fashion (see Re- 
mark 4.1.5), qx can be used to construct the vector | U(q.)) € H, whose 
coefficients (b; | L(g.) ) in the expansion with respect to the ONB B provide 
probabilities |( b; | U(q.) )|? that can be used to construct a Shannon-Fano- 
Elias code-word q(i) for |b; ) (see Example 3.2.3). Therefore, qg, qx and q(i) 
can be used to construct a self-delimiting program q = qgq.q(t) such that U 
does the following: 


— it constructs the directly computable basis 6 and the vector | U(q..) ); 
— it computes the Shannon-Fano-Elias code for B with respect to | U(q.) ); 
— it outputs the vector with code-word q(i). 


Since | U(q)) = | bi), from (x) one gets 
E(px) < Lla) < Elas) + (g(t) + KB) +C = QC, (bi) + KB) +C, (+) 
whence, up to an additive constant, 

QC, (P) = minf ep) : |) = 14) 


for all W belonging to a directly computable ONB . 


The preceding example can be used to show that bit quantum complexity 
and classical prefix complexity agree on bit strings. 


Proposition 9.3.1. For all i € 23, QC. (|i)) = K(i) up to an additive 
constant. 
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Proof: Choosing the computational basis {|i )},m) € QA”) as the di- 
rectly computable ONB B of the previous example, the result follows from 
(xx) because the shortest program that tells 4 how to generate B is now such 
that (qg) = O(1). 


For generic qubit strings, a loose upper bound is easily obtained. 


Proposition 9.3.2. /309] If V € Hn is normalized 
QC, (W) < 2n +C, 


where C is a constant independent of W. 


Proof: Consider the computational basis vectors | a ) € Hn; by expand- 

ing |P) = Vimegm c(i™)|i™ }, there must be at least one i!” such 
2 

that |c(i™)|2 > 27”. Let p € Q% be a self-delimiting program such that, by 

literal transcription, |U(p))} = | 4”). Then, with such a choice of effective 

description of |W) one gets the upper bound 


QC, (Y) < Kp) + |- log: (U0) |v)? | <2n +C. 


A lower bound to the bit quantum complexity of a subset of W € H, can 
be obtained following an argument developed in [119]. For any ¥ € H,, and 
a > 0, let us define the subsets 


5 2 Mal¥) := {p € Q3 : — logy (Up) |P)? <a} 


and the quantities QC,(W) := min{f(p) : p € Ha(¥)}. If a > 8, then 
ITg(W) C Ha(Y), whence 

a > P => QCg(H) > QC) > QC) - 
Notice that QC,,(W) is the length of the shortest classical programs p such 


that | U[p]) is non-orthogonal to |), |(U(p) | ¥ }| > 0. Therefore, if QC, (X) 
is attained at q, that is if 


QC, (X) = (a) + [= 1083 (UC) ZIP] 
— 
then, (q) > QCg(W) > QC.(W) for all a > 8 and 
QC, (W) > QC.) + 8. (9.16) 


The following Lemma shows that there are vectors Y € H, for which QC,,(¥) 
cannot be small. 
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Lemma 9.3.1. Let K(d) C Hn be a d-dimensional subspace; then, for all 
0 <a < log, d there exists a subspace Ka C K(d) of dimension da > d — 2° 
such that QC,,(W) > a for all Y € Ka. 


Proof: According to (4.6), there are less than 2° programs p € $23 with 
L(p) < a; it follows that the subspace H(a) linearly spanned by the corre- 
sponding vectors | {[p]) has dimension < 2°. Let K(d) C H, be any subspace 
of dimension d > 2“ and choose K, C K(d) orthogonal to H(a) and thus of 
dimension da > d — 2%. Now, |W) € Ka satisfies QC,,(W) > a, unless there 
is a program p with (p) < a with (U(p) |W) 4 0; this is impossible since, by 
construction, |%) is orthogonal to the linear span of | U[p]) with (p) < a 


Unlike for the qubit quantum complexity QC, where the corresponding 
complexity rate could be controlled by means of high probability subspaces, 
in the case of the bit quantum complexity QC., one has to argue in terms of 
volumes of vectors. Indeed, one can estimate how many unit vectors |) € Ka 
satisfy QC, (W) < r. This will be done by representing W as a point u € R24 
on the unit sphere Sq, whose coordinates are the real and imaginary parts 
of the Fourier coefficients of the expansion of |W) with respect to a chosen 
ONB in the subspace Ka. 

Let Soa, (0) denote the area of the sector of Sq, consisting of unit vectors 
u € R4 which have scalar product 1 > u- e > cos with respect to a fixed 
vector e; it is expressed by 


0 
~E J dé Ava, -1(sing) , 
0 


where 
2r” /2 


P(n/2) ° 

with T(z) the Euler Gamma function, is the area of the unit sphere Sn in 
R” of radius t (notice that area of the unit sphere and area of the sector of 
angle 0 are related by An(1) = S;,(7)). The sector area can be bounded from 
above as follows: 


Igda —1/2 2gda—1/2 3 
j (da—1) 
Soa, (0) = Tla — 1/2) 7h do sin?! 6 < Tla = 1/3) sin 0. 
(9.17) 


Let S24, (0) denote the sector area S24,(0) normalized to that of the unit 
sphere, Aga, (1); then, 


Ar 


a Soa, (9) antes T(da) -aqai 

0) = P (da—1) Q 
Sra, (0) Aga, (1) 7 T(da — 1/2) 2742 om 
< dae (e-NG—#/2)" (9.19) 


(9.18) 


where the last inequality comes from expanding f(@) := logsin 0 around 7/2, 
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FO) = -5(0-0/2)?+ 2" O (0-1/2) < -Z0-m/29?, 0B n/2, 


and from the fact that f” (0) > 0. We now use (9.19) to estimate the relative 
volume, F,(p,a), of the subset 


Fa(p,a) = {|¥) € Ka : logs (1p) |4)? < a} 


consisting of vectors with penalty smaller than a with respect to a given 
output | U[p] ). Since 2-°/? < |(U[p] | y )| = cos 6 = sin(m/2 — 0) < 1/2 — 9, 


Loe) < da e7792 . 
From this inequality we further deduce 
Lemma 9.3.2. The relative volume, F? (a), of the set 
F(a) = flp) Ke : QCA(¥) <r} 
has relative volume F? (a) such that 
Fi (a) < da 2” ee. 
Proof: If |) € Kg is such that QC, (Y) < r, then — logs |(w|LU[p])|? < a 


for at least one program p with (p) < r; the result then follows since there 
are < 2” such programs. 


The complement G™(a) of F? (œ) consists of |Y) € Ka such that either 
— logs |(w|U[p])|? > a or — logs |(w|LU[p])||? < a, but &(p) > r. In other 
words, from (9.16) and Lemma 9.3.1, it turns out that G?(a) consists of 
|Y) € Ka such that 


QC. (Y) > QC (ZY) +a> ata (9.20) 


or 
QC, (4) = r — loga |(| Up] )||? > r- (9.21) 


Notice that the relative volume G? (a) of G7 (a) is large, Gi (a) > 1 — e if the 
relative volume of F? (a) is small, F(a) < €. Lemma 9.3.2 can then be used 
to prove that for a large fraction of vectors |Y} € Ka one has QC, (Y) = 2n 
when n — oo. 


Proposition 9.3.3. For any e > 0 and N 3 n large enough, there exists 
a subspace Kn—ı C Hn of dimension > 2-1! containing a subset Gn_1 of 
relative volume Gn—ı > 1 — € such that, for all |Y) € Gri, 


„< QC) 


n 


2 <2+e. 
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Proof: Choose H(d) = H, in Lemma 9.3.1 and a = n—1; then, there exists 
a subspace K,_1 C H, of dimension d,_; > 2” — 27t = 2”~! such that 
QC,,(Y) > n—1 for all Y € K,_1. Setting r = 2n and a = n — 1 — 2log n 
in Lemma 9.3.2 one gets 

F??? (n—1-— 2log n) < e7(1-27"+!)n? + (3n—1)log2 
Thus, for n sufficiently large, the subset Gn-1 C Kn-1ı of Y € Kn-ı that 
violate QC,,_1~2 10g, n(¥) < 2n has relative volume Gn—1ı > 1 — e. The result 


then follows by applying the lower bound in Proposition 9.3.2 and the upper 
bounds (9.20) and (9.21). 


Remark 9.3.1. Proposition 9.3.3 states that a large fraction of n-qubit vec- 
tor states belonging to a subspace of dimension not less than 2”~! has a bit 
quantum complexity per symbol close to 2. Notice that this is twice the qubit 
quantum complexity per symbol of all pure states in the high probability sub- 
spaces of a Bernoulli quantum source (see Example 9.2.2). However, in the 
latter case the fact that the complexity rate ~ 1 follows from the specific 
structure of the state w on the quantum spin chain A. Instead, the result 
of Proposition 9.3.3 does not refer to the considered n qubits belonging to a 
quantum chain and thus to a reference global state w. Indeed, the weights of 
the subsets of vectors with bit quantum complexity rate ~ 2 are estimated 
in terms of relative volumes instead of probabilities as in Theorem 9.2.2. 


Circuit Algorithmic Complexity 


Any state |W) € H, of n qubits can be obtained as the result of an action 
on a fixed state |) € Hn by a suitable unitary operator U : H, + Hy. 
From Remark 9.1.1.2 we know that the action of U can be approximated 
within any £ > 0 by means of a quantum circuit, that is by another unitary 
operator V : H, +> Hn, consisting of N(U,¢) gates from a complete gate 
basis G. Furthermore, to leading order in the number of qubits and of the 
accuracy €, the number of gates scales as N(U,¢) = o(2" log a 

This fact is essential in the definition of quantum alconi complexity 
proposed in [205, 206, 207] where the focus is not on the effective description 
of the n-qubit states, whether quantum or classical, rather on the effective 
description of the quantum circuits that can be used to effectively construct 
those quantum states up to a certain fixed accuracy €. 

Given a complete gate basis G, the fixed ready state |®) and the accuracy 
parameter € > 0, a same |W) can be reached up to £ by a certain set CF 
of quantum circuits Ve that will be identified with their unitary actions 
on |}. Let the description of any of these circuits be encoded by a binary 
string tyg.: € $25; then 
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Definition 9.3.2 (Circuit Quantum Complexity). Let ne C 25. be 
the subset of strings that encode the circuits in ef. the circuit quantum 
algorithmic complexity of an n-qubit state |W) € Hn is the least classical 
prefix algorithmic complexity of the strings in QF: 


QCSE (W) := min {K(éyg.) : ivan € oP 


Remark 9.3.2. The dependence of QCy¢ on the encoding of the descrip- 
tion of the circuits vg Eg egs can be handled as in classical algorithmic 
complexity: a change of code is taken care of by a finite additive constant 
corresponding to a suitable dictionary which is thus independent of the circuit 
described. 


The physical motivation behind such a definition is that, after all, quan- 
tum states can be prepared by means of arrays of unitary gates that can be 
effectively described; then, the idea is to relate the complexity of vector states 
to the degree of compressibility of the descriptions of the quantum circuits 
that provide suitable approximations to them. 


Example 9.3.2. If one want to reproduce a bit string i" by a quantum 
circuit, the first step is to associate it to a qubit vector state | i‘) ) of the so 
called computational basis (see Section 4.1.1), where 


|i) = |i) 8i) 8 lin), 47 =0,1, oslty)= (Daj) - 


This state can then easily be obtained by flipping with o} the j-th qubit 
of |0)®". The corresponding quantum circuit consists of n 1-qubit gates, 
either trivially the identity matrix lə or the Pauli matrix a; therefore, an 
upper bound to the algorithmic complexity of the classical description of such 
a quantum circuit is easily seen to scale as n, exactly as the Kolmogorov 
complexity of a generic bit string of length n (see Proposition 4.1.1). 


Within this approach one usually estimates the complexity by upper 
bounds that depend on results as the one quoted in Remark 9.1.1.2, whence 
the circuit complexity of a state |W) of n qubits can be estimated as follows: 


QCH (YW) = O(n?2" log 1/e) , (9.22) 
where f(n) = O(g(n)) if there exists Cp g > 0 such that |f(n)| < Cr glg(n)]. 
Remark 9.3.3. As already pointed out (see for instance Example 3.2.2), a 


bit string i™ of length n can be associated with an interval in [0, 1] of length 
27”; this latter can also be interpreted as the volume of the subset V(i'”) 
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of bit strings of any length that are prefixed by oT herefore, the upper 
bound (4.5) to the algorithmic complexity of i™ scales as — log V (i™) = n. 
In a quantum context, because of the lack of discreteness, given a fixed n- 
qubit vector W, one can in general only hope to construct it within an error 
£; namely, a quantum circuit can be devised that outputs |Y) € (C?)®” such 
that |(w|W)| > 1—e for some accuracy parameter 0 < e < 1. Using (9.17), 
one finds that the logarithm of the volume V.(W) of such a cone scales as 
2” loge in agreement with (9.22). 

Notice that the upper bound to the bit quantum complexity in Proposi- 
tion 9.3.2 is obtained by choosing |% )} such that |(w~|W)| > 27”; this cor- 
responds to a parameter in (9.22) which scales as e ~ 1 — 27” and to an 
upper bound to QCSE (W) which is only polinomially different from the one 
to QC, (W) [207]. 


By means of the upper bound (9.22), it is possible to put into evidence 
the difference in circuit complexity between separable and entangled states. 


J 
The important point is that a product state |W) = Q |;) € H, can 
j=l 


be constructed with accuracy € by means of n circuits that construct the 
vectors | ®; ) with accuracy ¢/n [205]. Suppose that the state | W } shows some 
entanglement between its constituent qubits; namely, |W) = Qa |8; ), 


where Da nj =n and |8; ) € H,,, are entangled states of n; qubits. Then, 
one obtains 


J 
QcS: (Y) = oD n22" log a . 
Z 


For sufficiently small £, the sum is upper bounded by 


J 
J J J 1 
29Nnj 2 29n ea 29N = 
a log = S n 2a log z S n2 log z. 
j= 


Therefore, the upper bound is the largest when the state |W) is completely 
entangled, that is when J = 1 and nj = n; vice versa, in the case of complete 
separability, namely when J = n and n; = 1, one gets 


QCR (V) = O (2n10g =) , 
E 
with polynomial instead of exponential increase with the number of qubits. 
y 


Quantum Universal Semi-Density Matrix 


Like in Section 9.3, the quantum extension of classical algorithmic complex- 
ity proposed in [119] starts from the classical description of quantum states 
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|W) € Hn of n qubit systems by means of bit strings i € NZ. However, 
these descriptions are not considered as programs of a certain length which 
has to be minimized, rather as bit strings characterized by a given universal 
probability Py as explained in Remark 4.3.2.3. 

For instance, if a state |W) can be expanded with respect to the com- 
putational basis {| 4”) ) tie eqe by means of coefficients which are exactly 
computable by a program j € Q5 processed by a fixed UTM 4, then 


m(Y) = Pali) 


naturally represents the universal probability of this state. Exactly com- 
putable states are termed elementary as well as linear operators X on Hp 
whose matrix elements with respect to the computational basis can be exactly 
computed; operators which can be approximated from below by an increas- 
ing sequence of elementary operators are called lower semi-computable (see 
Definition 4.1.4). Then, an argument similar to the one in Example 4.1.7.3 
leads to the following result [119]. 


Theorem 9.3.1. A lower semi-computable semi-density matrix p € Bi(H,), 
namely p > 0 and p < 1, can be effectively constructed such that, for any 
other semi-computable semi-density matrix o € B,(H,,), there is a constant 
Co for which Coo < p. Moreover, p can be identified with 


P= 5 M(Ver) | Ver) (Ver | ’ 


| Per ) EHn 


where the sum runs over all elementary vector states of n qubits. 


The operator p is a convex combination over elementary projections 
weighted with their universal probabilities; since the universal probability 
Py is not normalized, neither is p. Inspired by Remark 4.3.3, it is thus sug- 
gestive to introduce an operatorial complexity 


k := — log2P , 
and two possible definitions of algorithmic complexity of a state |W): 
QCrp.(Y) = —log,((¥|plZ)), = QCL, (W) = (Y| K |Y). 
From the the concavity of the function f(x) = log, x it follows that 
QCr yp. (Y) < QCT, (P) . 


Indeed (see the proof of Proposition 5.5.3), with p = >0, ri|ri)(ri| the 
spectral decomposition of p, 
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logs((¥ | p|¥)) = logs (= ri (ein?) 


i 


> SEF |r)’ logar: = (¥ | logs pl) . 


For bit strings i € 925 the two possibilities coincide with the algorithmic 
complexity KG”), Indeed, consider the computational basis vectors i”), 
then {(i™ | p|i™ eemen is a semi-measure. As m is a universal semi- 
measure on 23 it follows that there exists a constant C, such that 


Ca” | ph) < ma) , 


whence, from Remark 4.3.3, QCz,,, (i) > K(i™) + O(1). 

On the other hand, p = Limes m(i™) |4™ )(4™ | is a lower semi- 
computable, semi-density matrix. Therefore, the monotonicity of the loga- 
rithm as an operator function (see Example 5.2.3.6) yields 


Cpp < p= -logs p + log2Cp 2? K, 


whence QC, (™) < K(i™) + O(1). 

The operatorial complexity « has an interesting similarity with the clas- 
sical algorithmic complexity in that its mean value with respect to a lower 
semi-computable density matrix p equals its von Neumann entropy up to an 
additive constant [119] (compare Corollary 4.3.2) 


Tr(p&) = S2(p) + O(1), S2(p) := —Tr(plog, p) . 


Setting p := TOR the positivity of the relative entropy, S (p, p) > 0, yields 


So(p) < Tr(pk) + log, Tr(p) . 
By assumption, there exists a constant C, such that Cp p < p; thus, as before, 
— logs Cp — logy p > k => S2(p) > Tr(px) + O(1) . 


Remark 9.3.4. The topics addressed in this chapter are relatively recent 
and still in their infancy so that the relations between the various extensions 
of classical algorithmic complexity theory to quantum systems are largely to 
be explored (a discussion of those between Vitanyi’s and Gács’ proposals can 
be found in [119]). 

Further, beside the previous result and Theorem 9.2.2, the connections 
between quantum algorithmic complexities and the von Neumann entropy or 
the von Neumann entropy rate have not yet been clarified. In particular, the 
randomness of the quantum dynamics, rather than of quantum states have 
not been tackled yet; namely, a quantum extension of the dynamical version 
of Brudno’s theorem (see Corollary 4.2.1) is still missing. 
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embeddings, 158-159 
N-positive, 159 
positive, 157-159 
decomposable, 262-263 
Markov approximations, 243 
Martin-Lof tests, 106, 128 
Master equation, 243, 245, 251, 283 
Maximal accessible information, 297, 
299 
Measurement processes, 139, 236-238, 
412-413, 455 
Measures 
o-algebras, 10, 77-78 
absolutely continuous, 34, 35, 55, 56, 
328, 355, 442 
Lebesgue decomposition, 34 
mutually singular, 34 
product, 24, 29, 49, 78 
Minmax principle, 214, 217 
Mixing systems 
classical, 40—47 
K-mixing, 46, 51, 81, 352-353, 357, 
445 
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weakly mixing, 46 
quantum 
hyperclustering, 352 
strongly mixing, 352, 355, 356 
weakly mixing, 352-354, 356, 358, 
376 
Modular theory, 212, 332 
KMS conditions, 330-335 
modular automorphisms, 232, 335 
modular conjugation, 212, 332, 333 
modular group, 233, 236, 332, 335 
modular operator, 213, 232, 333, 447 
relative modular operator, 287—290, 
332 
Mutual information, 63, 67—69, 
100-101, 295, 477 
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Neighborhoods 
strong, 31, 141, 169 
uniform, 30-31, 140-142 
weak, 32, 141 
Non-commutative deformations, 139, 
231, 344, 388 
Norm 
C*, 144 
sup-norm, 38 
uniform, 33, 140, 143-145, 152, 182, 
185 
Number operator 
Bosonic, 189, 325, 326-327 
Fermionic, 181, 325-326 
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Observables 
functions, 9-10 
local, 323, 343, 347, 351 
Occupation number states 
Bosonsic, 189 
Fermionic, 232 
One-way quantum computation, 315 
Open quantum systems 
dynamical semigroups, 247-248 
Kossakowski-Lindblad generators, 
250 
Kossakowski matrix, 247, 249, 
280-281 


reduced dynamics, 227, 242-243, 251 
Operational partitions of unit (OPUs), 
451-452 
density matrices, 452—453 
refinement of, 451, 471 
time-evolution, 451, 452 
Operations 
local, 253, 259, 267-268, 277, 278 
LOCC, 258, 267-270 
non-local, 223 
Operators 
bounded, 140-141, 148, 144, 146, 
167, 170-172, 182, 323, 327, 479 
compact, 152, 167, 170 
finite rank, 152, 170 
Hilbert-Schmidt, 155-156 
isometric, 12, 147 
partial isometries, 148-149, 311, 492 
polar decomposition, 148-149, 153, 
321 
singular values, 149, 153, 155-156 
tensor product of, 141, 145, 180, 228 
trace-class, 152-155 
unitary, 12, 15, 53, 140, 147, 178, 184, 
185, 203, 208, 226, 227, 231, 239, 
256, 258, 307, 320, 330, 335-338, 
340, 343, 347, 357, 396, 397, 404, 
465, 485, 488-490, 502, 511 
Weyl, 184-185, 187, 189, 202-203, 
208, 229-230, 236, 277, 327, 
337-339, 404, 447, 470, 477 
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Partition functions 
Bosonic, 235 
Fermionic, 234 
Partitions, 25-27 
entropy rate, 73-75 
generating, 50-52, 76, 79-81, 93, 94, 
127, 428, 436 
tail, 51-52 
Partitions of unity, 238, 451 
Pauli matrices, 179-180, 226, 228, 246, 
250, 259, 260, 272, 282, 285-286, 
319, 350, 373 
Probability 
empirical, 96, 125 
Probability distributions 


conditonal, 35, 60, 63, 157, 190-191, 
198, 298-299, 389-391, 417 

joint, 58, 62, 63, 67 

marginal, 58, 62, 414, 416 

Projectors, 37 

minimal, 37, 39, 151, 297, 420—421 

orthogonal, 139, 140, 190, 240, 293, 
370, 395-396, 468, 500 
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Quantum 
circuits, 223, 484, 511-512 
gates, 223, 256, 258, 490 
noise, 249 
universal semi-density matrix, 484, 
513-515 
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Radon-Nikodym derivative, 34-35, 55, 
56, 164 
Random sequences 
chaoticness, 105, 129, 484 
stochasticness, 105-106 
typicalness, 106, 129, 484 
Random variables, 57-58, 60, 62-68, 
71, 90, 99, 100, 225, 295, 421, 425, 
426 
Reduction map, 161, 164 
Regular motion, 14-15 
Relative entropy 
classical, 96 
quantum, 221, 287 
joint convexity, 289-292 
monotonicity under CPU maps, 
498-499 
Relaxation to equilibrium, 56, 251, 
317-318 
Representations 
Fock, 183, 325, 326 
GNS, 170, 172, 175, 210-213, 300, 
320-321, 331, 332, 335-336, 338, 
343, 347, 350, 351, 357, 456, 463, 
474-478 
momentum, 184, 188, 192, 328 
position, 183, 184, 192, 198-199 
thermal, 332, 334 
Resolvent, 146 
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Scalar product 
Hilbert-Schmidt, 155, 161, 179, 247, 
287, 290-291 
Semi-computability, 118-120, 132, 133, 
514, 515 
Semi-computable functions, 120 
Semi-norms, 31, 32, 141, 143, 155, 173, 
208 
Sesquilinear form, 154, 173, 200-202, 
342 
Shift dynamical systems 
Bernoulli, 24-25 
Markov, 24—25 
Spectrum, 46, 47, 53, 146, 148, 149, 
152, 158, 174, 176, 177, 182-183, 
213-215, 218, 221, 227, 237, 263, 
328, 332, 355, 359, 392, 393, 437, 
463, 464 
Spin chains 
classical, 37, 39-40, 53, 180, 363 
quantum, 2, 347, 381—405 
States 
coherent, 191-193, 200, 235, 481 
cyclic, 66, 153, 168, 172, 178, 186, 
203, 212, 213, 220, 232, 237, 299, 
301, 324, 330, 332, 334, 335, 338, 
343, 349-350, 355, 403 
entangled, 222-226, 246, 253, 
258-260, 262, 263, 266-271, 273, 
281, 383, 485, 513 
equilibrium, 7, 12, 14, 56, 71, 227, 
232, 234, 235, 293, 330 
expectation functionals, 33, 139, 170, 
320, 343 
factor, 350, 351, 354, 363 
faithful, 212, 299-300, 330, 332, 357, 
443—444 
finitely correlated, 366-372, 438 
Gaussian, 188—210, 276-279, 314-315, 
329 
Gibbs, 232-234, 292, 330 
global, 22-24, 38, 40, 362, 369, 441, 
511 
KMS, 232, 233, 330-332, 354, 375 
local, 22-25, 28, 39, 253, 363, 367, 
376, 378, 381, 383, 390, 393, 395, 
431, 438, 467 
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261, 266-267, 269, 271, 291, 376, 
404, 433, 434, 456 
mean values, 11-14, 33, 34, 40, 41, 
44, 90, 107, 139, 157, 160, 190, 
237, 277, 515 
mixed, 190, 210, 213, 216, 225, 241, 
267, 270, 294-295, 312, 399, 456, 
486 
normal, 334, 335, 376, 433 
NPT, 263, 268-269 
phase-points, 11, 12, 139, 274 
positive functionals, 33-34 
PPT, 263, 264, 268, 269, 271, 278, 
284-286 
PPT entangled, 263, 264, 268, 269, 
271, 286 
probability distributions, 12 
pure, 173-176, 190-191, 209, 211, 
213, 216, 219, 223-229, 237, 238, 
244, 266, 267, 269-270, 272, 295, 
302, 303, 313, 314, 350, 351, 385, 
420, 455, 456, 459, 497, 501, 506, 
511 
entangled, 266-267 
purification, 210 
quantum 
convex decompositions, 294, 414, 
448 
quasi-free, 327, 329-330, 378, 442 
separable, 224—225, 246, 262, 265-266 
separating, 168, 172, 178, 212, 213, 
247, 301, 332, 334-335, 350 
shift-invariant, 2, 25, 364, 439 
space of 
convex structure, 173, 190 
symmetric, 151, 158, 225, 259, 265 
tracial, 174, 175, 232, 266, 293, 298, 
321, 359, 373-375, 421, 447, 450, 
468-470, 506 
Stationary couplings, 433-436, 441 
Stochastic matrices, 25 
Strings 
bit, 256, 485, 492, 512-513 
qubit, 385-387, 484-486, 494, 497, 
501, 506-508 
Symbolic dynamics, 25—40 
Symbolic models 


classical, 295, 399, 404 
quantum 
OPUs, 451, 452-454 
Symplectic 
form, 18, 337 
matrix, 13, 205, 207, 278-279 
structure, 13 
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Tensor products 
of algebras, 37-39 
of projectors, 37 
Theorem 
AEP 
classical, 91-95 
quantum, 394 
Birkhoff ergodic, 44—45 
Brudno 
classical, 123-127 
quantum, 497—498 
Kolmogorov-Sinai, 76, 428 
Kraus, 163, 227 
Liouville-Arnold, 16 
no-cloning, 258, 260 
noiseless coding, 95, 384 
Shannon-Mc Millan-Breiman, 93, 94, 
387, 389, 411 
Stinespring, 162, 163, 367 
von Neumann bicommutant, 169 
von Neumann ergodic, 47 
Time-evolution 
continuous, 9, 14, 40, 41 
discrete, 9, 13, 14, 16, 21, 27, 33, 41, 
50, 51, 76, 342, 465, 489 
irreversible, 9, 241 
reversible, 9, 241-242 
Topology 
strong, 31-32, 141, 142, 169, 170 
uniform, 30, 31, 140-142, 208 
w*, 143, 208 
weak, 32, 47, 141-142, 155, 166, 208 
o-weak, 141, 143, 155, 208-209 
Totally symmetric projector, 151 
Totally symmetric vector, 159 
Trace, 38, 152-156, 161, 165, 190, 199, 
202, 214-216, 219, 221, 224, 227, 
232, 235-237, 241-242, 244, 249, 
253, 262, 268, 282, 283, 288-290, 


292, 310-314, 322-324, 340, 363, 
365, 366, 375, 381-384, 387, 397, 
403, 413, 434, 479, 486, 492—496, 
498, 499, 504 
partial, 242, 479, 490 
Trace map, 156, 161, 165, 282 
Transition probabilities, 25, 39, 48, 49, 
57—59, 61, 99, 111-112 
Transmission rate, 404—405 
Transposition, 140, 158, 159, 162, 163, 
211, 261-263 
partial, 158, 263-269, 277, 285 
mirror reflection, 277, 278 
Triplets 
classical C* algebraic, 39 
measure theoretic, 29, 33, 34, 39 
quantum algebraic, 332 
Turing machines 
classical, 108, 109 
prefix, 107, 127 
probabilistic, 486 
transition functions, 109-111 
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universal, 108, 492 
quantum, 3, 255, 486-487, 491—492 
universal, 492 
Types, 96, 321-322, 340, 406 
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Uncertainty relations, 196, 197, 201, 
317 
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Vacuum state, 181, 182, 191, 322, 332, 
336 
Bosonic, 189 
Fermions, 182 


WwW 


Wave-packet reduction, 237—239, 241 
Weyl relations 
continuous variables, 185, 186 
discrete, 187, 231 


